Gene Franean1_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3049 
Symbol 
ID5671428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3584655 
End bp3586379 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content69% 
IMG OID641241947 
ProductABC transporter related 
Protein accessionYP_001507367 
Protein GI158314859 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.511073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACAG AGCACGCGCT GCCGGTCGCG GGCGGACGTG AGACCGCTCG GGAGGTGTGG 
CGGCTCAGCC GTGGGCATCG ACGCAGTCTG GCCGCCCTCG TGGTGCTGGG AATCGCAAGC
ACCGCCATCG ACCTGATCGG ACCCGTCGCG ATCGGGTTCC TCATCGATCG GGTCCAGGAA
GGCGCCGCCG ACCTCGGTAC CGTGCTGACC GCCATCGCGA TCATGGCGGT CTCGGCCATT
CTCGGTGCCG CTGGCACGGC GGCGACGATC GTCCTGGCTA CTCGCATGTA CCACACCGTC
CTCGCCGGGC TGCGGGAGGA GCTGGTCTCC CGTGCCCTGA CGCTGCCGCA GCATGTCGTC
GAGTCTGCCG GCACCGGGGA TCTGATCTCG CGGTCCAGCG ACGACGTCAC CGCGGTTGCC
GATGCGGCTC CCGCGGTGAT CCCGGCGCTT ACCGTTACGT CCTTCACCAT CGTCATGTCG
CTGGGCGGGC TGGCCGCGGT GGAATGGCCC TACGCCGCCG CCCTTGCCGT CGTGCTGCCT
GTCTACGTGC TCTCCATGCG GTGGTACCTG CGAACAGGCC CGCGGGTATA CCTGGCCGAG
CGTGCAGCGA TGAGCGCGCG TGCTCAGCAG ATTCTGGAGT CGCAGCGCGG CTACGCCACT
GTGCTCGGAT TCAGGCTTGC CGAGCAGCGG CACCGCGCCG TGACCACCAC CTCCTGGGGC
GTATCGGTGC AGGCGTTGCG GGCGCGCACC GTGCAGAGCA TGCTCAACAC CCGCCTGAAC
CTCGGCGAGT GCCTGAGCCT GGCCGCCGTG CTCGTCGTCG GCTTCGTCCT CATCGACCAC
GGAGCCTCGA CTGTCGGGGG CGCGACCACC GCCATGCTGC TCGTGCTACG CCTGCTGAAC
CCGGTCAATC AGCTGCTGTT CGTCATCGAC ACTCTCCAGT CCGCCCTCGC GTCGCTGAAC
CGCATGGTCG GAGTCACCAC CATCCCCGTC GCGGACGCGC CAGGCATCGC AACGAGCAGC
AGCGTCCACC TCCGCGAGGT CTCCTTCCAC TACGGGATCG GCCCCCGTGT GCTCGTCGAT
GTCACGCTCG ACATCCCCAC CGGTCAGCGT GTCGCCGTCG TAGGTTCGTC GGGTGCCGGC
AAGTCCACGC TGGCCACGGT GGTCGCCGGC ATTCACCAGC CCGACGCCGG GACCGTGGCC
CGACCGGAGC GCACGGTGAT GATCACCCAG GAAGTACACG TGTTCGCGGG GACATTGCGA
GACAACCTCA CCCTCGCCGC ACCGGATGCC ACCGACGGTC AGGTAGGGGC TGCGCTGGAA
GTGGCCCAGG CTGTAGGGAT GCTCGACCTG CTGCCCGATG GCCTTGACAC GGTGCTCGGT
GGTGGCGGGT ACGAGCTGAC CGCCGCACAG GCACAACAGG TGGCGCTCAC TCGCCTGGTG
CTGGCCGACC CGGAACTGGC GATCTTCGAC GAGGCCACCG CCGAGGCAGG TTCCGCGTAC
GCCGGACTGC TCGACCGCGC CGCCGACGCT GCGCTGACCG GACGCACTGG ACTGGTGATC
GCGCACCGGC TCTCGCAAGC CGCCGCCTGC GATCTGGTCG TGGTGATGGA GCACGGCCGT
ATCGCCGAGC GAGGAACCCA TACAGAGCTG ATCGCCGCCG ACGGGGTGTA TGCCGCGCTT
TGGTCGGCAT GGCGGGCCGG GCAGGAAGCT GGAGCGAATG GGTAG
 
Protein sequence
MTTEHALPVA GGRETAREVW RLSRGHRRSL AALVVLGIAS TAIDLIGPVA IGFLIDRVQE 
GAADLGTVLT AIAIMAVSAI LGAAGTAATI VLATRMYHTV LAGLREELVS RALTLPQHVV
ESAGTGDLIS RSSDDVTAVA DAAPAVIPAL TVTSFTIVMS LGGLAAVEWP YAAALAVVLP
VYVLSMRWYL RTGPRVYLAE RAAMSARAQQ ILESQRGYAT VLGFRLAEQR HRAVTTTSWG
VSVQALRART VQSMLNTRLN LGECLSLAAV LVVGFVLIDH GASTVGGATT AMLLVLRLLN
PVNQLLFVID TLQSALASLN RMVGVTTIPV ADAPGIATSS SVHLREVSFH YGIGPRVLVD
VTLDIPTGQR VAVVGSSGAG KSTLATVVAG IHQPDAGTVA RPERTVMITQ EVHVFAGTLR
DNLTLAAPDA TDGQVGAALE VAQAVGMLDL LPDGLDTVLG GGGYELTAAQ AQQVALTRLV
LADPELAIFD EATAEAGSAY AGLLDRAADA ALTGRTGLVI AHRLSQAAAC DLVVVMEHGR
IAERGTHTEL IAADGVYAAL WSAWRAGQEA GANG