Gene Franean1_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3584 
Symbol 
ID5671953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4245067 
End bp4247688 
Gene Length2622 bp 
Protein Length873 aa 
Translation table11 
GC content72% 
IMG OID641242470 
ProductABC transporter related 
Protein accessionYP_001507890 
Protein GI158315382 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.282341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.46326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCTGG CCAAGAGCCT TCTGCCCCGT GTCCTCGCCG GCCCGCTGCT CGCCGTCGTG 
CTGCTGGTCG TCGTGCACGG AGAGCTGATA CCGGCATACC AGACCTACTC GCTGGCCCTG
GCGGCGACAT ACGCGGTCCT CGTGCTCAGC GTGGGCCTGC TGGCCGGATG GGCTGGCGTC
TGGTCGGTCG GCCACCCGGC CCTGTTCGCC ATCGGGGCCT ACACAGCGGC GTACGGCTCC
GCCCACGGAT GGGGGCTGGA GGTCACCGTG CTTGCGGCGA TGGCGCTGGC CGGAACCTGC
GGTGCCTTCC TGGGCTTCGC CGGTGCCCGT TTCTCCGTCC TCTATATCGC CCTGCTCACC
CTGGCCTTCA GCCTGGTCGC TCTCGAGGTG ATCAATCGGT GGACCGGTGT GACCGGCGGA
GACCAGGGTG TTCCTGTCCT CGAGCTGTCG AGCGTGCTCG GCCTGGGAAG CCTCGGTGGT
GGCAGCGCGG AGGCGATCGA CGCGGCGATC GTCACGGCTG GGGTCATGCT CACGGTCGCC
GCCCTCGCCC GGCCTATGGG CCTGCGGATG CGCATGGTCG CGGCGAAGTC GCATCCGCTG
GCGGCTCGCT CCATCGGAAT CGCCCCGGAG GCACAGACCG CGCTGGCCTT CGGGGCCAGT
GGGGCGGCCG CCGGGCTTGC CGGTGTGCTA CTAGCCCTGA TCACTGGCTA CGTCAGCCCA
GAGTCGTTCT CCCTGGTCTT CGGCATCAAC ACGATCGCCG CGGCAGTGCT CGGCGGTGTC
GGCACCATCG CCGGAGCCGT GGTCGGCGGC GCCTTCATGG CGTGGTCACC CACCCTCGCC
GACGACATCG GGGTCAGCCA GATGGTCGTC CAGGGCACCG TGCTGATCAT AGTTCTTCTC
CTGCTGCCCA GCGGTGTCGT GCCGGCCGTC GCGAGACTCG GGCGGGCGGT GTTCAGGCGG
GCGGTACTCC CGCGGGCACC GTGGCTGCGG CCGCGTCCCG GCCCGGCCTC CGGTTTCGCC
GACGGGATGG CCGACGGGAC GGCCGACCGA GCCACCGGCC CGGCCGCGCC GGGCCTCGTC
CCGGCCGCAC GCCCCGAGGA CGCCGGGGGG ACAGCGTCGG GCGGCGCGGA CGCCGAGACC
GTGCTCGAGA TCAGCGACCT TGCGGTGACA TTCGGCGGGC TAAGAGCGCT GGAGGGAGTC
TCGCTCAGCG TGCGGAAGGG CGAGGTGCTC GCGATCATCG GGCCGAACGG GGCCGGAAAG
ACCACGCTGG TGAACGCGCT GTCCGGGCTG CTGCCCAGCG GTCGGGTCAC CGGGTCGGCC
CGCTACCGCG GGCACGAGCT CCTCGGCCGC CGCGCGACCG GCCGTCGCGG TCTGGGTATC
GGGCGCACGT TCCAGCATGC CGAGCCCTTC GGCGAGCTCA GCATGCTCGA GAACTTGCTG
TGCGCCCACC GGTGGCCCAC TGTGCGGCGG CGCGCCGAAG CCTGGCAGCT GCTGGAGCAG
GTCGGCCTGT CCGACGTCGC GGACCGCTCC CCGCACGAGC TGCCGTTCGG GGTACGCAAG
CGGCTCGACC TGGCGCGCGC CATCGTCGCG CATCCGGACC TGCTCATCCT CGACGAGCCG
TTCGGCGGGC TCGACGCCGG CGAGCGCGCG CTGCTCGCGA CCCAGGTGCG CCGGCTTCGC
GACGAGGGTG TTGCGATAAT CATCATCGAT CACGTGATCG AGGACCTGTT CGCGGTCGCG
GACCGCGTCG TGGCCTTCGA CTTCGGCAGG CCCATCGGTA GCGGCACCCC GGACGTCGTG
CTGCAGGACG ATGCCGTCCG CACCTCTTAC CTGGGCGCGG CAACGGTACG GCCACGCGCC
GCGCTCGCCG CGGGACGTGG CGAGCCGCTG GTCACGCTCA CCGCCGTCGG CCACCGCTAC
GGCGGCGTGG TCGCGCTGGA CGGCGTGGAC CTGCGGATCC CGCGGGGCGG CATCCTCGCC
GCCGTCGGCG CCAACGGGGC TGGCAAGAGC ACTCTCGGCA AGGTGGTGCA CGGCACCGTC
GTGCCGACCC GCGGCACCCG CGAGGTCGTC CAGGTCGACG GGCGCGCTTT GCGCTGCAGC
CTCATGCCCG AGGGGCGCGC CCTGTTCAAG TCCCTGTCGG TGCGGGAGAA CCTGGACGTC
GCCGCCTATG CGGCGGGGGT GCGCGGCGCC CTGCTCCGCC AGCGCCGAGA CGAGACTATG
GACTGGCTGC CGGACCGGGT GCGCAGCCGC ATGTCGGTGT CGGCGGGCGC GCTGTCCGGT
GGCGAGCAGC AGCTCCTCGC GACGGCTCGG GCACTGATGG CCGGGCCGGA CCTGCTCGTG
CTCGACGAAC CCGCGCTCGG GCTGGCGCCC GCCATGGTCG ACGAGATCTA CGAGCGCATC
GCCGGGCTGG CCGAACAGGG GCTGACGGTA GTGCTGCTCG AGCAGCTCCT GAGCCGGGCC
CTCAGCCTTG CCACCGACAT GGTGGTGCTG CACGAGGGCA CTGTCGCGGT CACGGGCTCG
CCCGCCGACC CCGGATTCGC CGAGCTTGCC GAGCACGCCT ACTTCGGCGG TGCTGCGGCC
GTCGCCTCCG GCACCCCGCT AACCGAGGCG GTGAGCCGGT GA
 
Protein sequence
MVLAKSLLPR VLAGPLLAVV LLVVVHGELI PAYQTYSLAL AATYAVLVLS VGLLAGWAGV 
WSVGHPALFA IGAYTAAYGS AHGWGLEVTV LAAMALAGTC GAFLGFAGAR FSVLYIALLT
LAFSLVALEV INRWTGVTGG DQGVPVLELS SVLGLGSLGG GSAEAIDAAI VTAGVMLTVA
ALARPMGLRM RMVAAKSHPL AARSIGIAPE AQTALAFGAS GAAAGLAGVL LALITGYVSP
ESFSLVFGIN TIAAAVLGGV GTIAGAVVGG AFMAWSPTLA DDIGVSQMVV QGTVLIIVLL
LLPSGVVPAV ARLGRAVFRR AVLPRAPWLR PRPGPASGFA DGMADGTADR ATGPAAPGLV
PAARPEDAGG TASGGADAET VLEISDLAVT FGGLRALEGV SLSVRKGEVL AIIGPNGAGK
TTLVNALSGL LPSGRVTGSA RYRGHELLGR RATGRRGLGI GRTFQHAEPF GELSMLENLL
CAHRWPTVRR RAEAWQLLEQ VGLSDVADRS PHELPFGVRK RLDLARAIVA HPDLLILDEP
FGGLDAGERA LLATQVRRLR DEGVAIIIID HVIEDLFAVA DRVVAFDFGR PIGSGTPDVV
LQDDAVRTSY LGAATVRPRA ALAAGRGEPL VTLTAVGHRY GGVVALDGVD LRIPRGGILA
AVGANGAGKS TLGKVVHGTV VPTRGTREVV QVDGRALRCS LMPEGRALFK SLSVRENLDV
AAYAAGVRGA LLRQRRDETM DWLPDRVRSR MSVSAGALSG GEQQLLATAR ALMAGPDLLV
LDEPALGLAP AMVDEIYERI AGLAEQGLTV VLLEQLLSRA LSLATDMVVL HEGTVAVTGS
PADPGFAELA EHAYFGGAAA VASGTPLTEA VSR