Gene Franean1_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3944 
Symbol 
ID5672305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4713651 
End bp4715045 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content70% 
IMG OID641242823 
Productextracellular solute-binding protein 
Protein accessionYP_001508240 
Protein GI158315732 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.920276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0416082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGAC GGCGCTTTGC CCGGGGCGTG GCGGCCGCCT GCGCTGCCGC GCTCGCCCTC 
ACCCTCGCCG CCTGCGGATC CGACGACGGC GCCGACGCGG GCGGGACGTC CACCGCCGGG
GTCGTGCCCG AGCTCGGGCC GGACCAGAAG GTCTCGATTG TCTTCGAGAG CTACAACCTC
GCCCAGCCCG GCCCGTGGAC GGACACCTTC AACGGCCTGA TCGCCGACTT CGAGAAGGCC
CACCCGAACA TCTCGGTCAC CGCGCAGAAG CCGCAGACCT CCACTCTGAA GGGCTACGGC
TCGGCGGCCA CCGCGAGCAT CCAGGCCCAG ATCGCCACCG GCAACGCCCC CGACGTCGCC
CAGCTCACCT TCGGCGACCT CGGCTACACC GCGACCGCGC TCGGCGCGAA GCCGCTCGAC
GACATCGTCG GGCGCGACGC CGTCCAGGAG AACTTCGCCG GCACCCACCC GTTCGCGCCG
ACGGCGCGCA CCCTCGGAGA TGTCGACGGC AAGACCTACG GCATGCCGTT CGTCTTCTCC
ACCCCGGTGC TGTACTACAA CGCCGACCTG TTCACCCAGG CCGGCCTCGA CCCGGAGAAG
CCGCCCACGA CCTGGGACGA GTTCAAGACC GCCGCGCTGG CCATCAAGGC GAAGACCGGC
AAGAATGGCG GTTACATCGA CTGCCTCACC AAGGTCTCCG GCGACTGGTG CTACCAGGCG
CTGGTCGCCT CCAACGGCGG CTCGGTGATC TCCGAGGACC GCACCAAGCT CACCTTCGCC
GAGGCGCCCG CGGTGCAGGC GGTCGAGATG GCGCAGGACC TGGTCAACTC CGGCGCCAGC
CCGAAGCTGT CACAGGACCA GGCCTACCCG GCGTTCGCCC GCGGTGAGAT CGGCATGATC
GTCGAGACCA GTGCGGCGCA GGGCACCTTC ATCAAGGGCG CCGGCGGCGC CAAGCCGCCG
TGGACGCTGC GCGCCACCGT CATGCCGAGC TTCACCGGCA AGCCGGTCGT GCCGACGAAC
TCCGGGGCGG CGCTGTTCAT GTTCGCCAAG GACGCGGCGA AGCAGCGGGC CGCCTGGGAG
CTGATCACCT ACCTGACCAG CGACGCGGCC TACACGCAGA TCACCAGCAA GATCGGCTAC
CTGCCGCTGC GCACCGGGCT GCTCGACGAC CCGAACGGCC TGAAGACCTG GGCCGAGCAG
AACCCGCTGG TCAAGCCGAA CGTCGACCAG CTCGCGAAGC TGAAGCCGTG GGTGTCCTTC
CCGGGCAACA ACTACGTCCA GATCCGCACC GGGATGCTCG AGGCGGTCGA GAGCGTCGTC
TACAGCGGCG CCGACCCGCA GAAGACGCTC ACCGACGCCC AGAACCAGGC CGCGAAGCTG
CTGCCCCGGT CCTGA
 
Protein sequence
MKRRRFARGV AAACAAALAL TLAACGSDDG ADAGGTSTAG VVPELGPDQK VSIVFESYNL 
AQPGPWTDTF NGLIADFEKA HPNISVTAQK PQTSTLKGYG SAATASIQAQ IATGNAPDVA
QLTFGDLGYT ATALGAKPLD DIVGRDAVQE NFAGTHPFAP TARTLGDVDG KTYGMPFVFS
TPVLYYNADL FTQAGLDPEK PPTTWDEFKT AALAIKAKTG KNGGYIDCLT KVSGDWCYQA
LVASNGGSVI SEDRTKLTFA EAPAVQAVEM AQDLVNSGAS PKLSQDQAYP AFARGEIGMI
VETSAAQGTF IKGAGGAKPP WTLRATVMPS FTGKPVVPTN SGAALFMFAK DAAKQRAAWE
LITYLTSDAA YTQITSKIGY LPLRTGLLDD PNGLKTWAEQ NPLVKPNVDQ LAKLKPWVSF
PGNNYVQIRT GMLEAVESVV YSGADPQKTL TDAQNQAAKL LPRS