Gene Franean1_2510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2510 
Symbol 
ID5670906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2988527 
End bp2990086 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content69% 
IMG OID641241427 
Productextracellular solute-binding protein 
Protein accessionYP_001506848 
Protein GI158314340 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCAC AACTCCGAGC CGATTCGACG CTAATGGGCC GCCGTGGCTT CCTCGGTCTC 
GGCGTGCTCG CTGGCTCCTC CCTGCTGCTC GCCGCCTGCG GCGACGACAG CGGTTCGGTC
TCGTCCGCCG GCAAGGAGGG CGGGACGTTG CGGTGGGGCT GGTCCGCGGT CACCTCCTGG
GACCCGGTGA CCTCGTCCGC GGGCTGGGAC GTGCACGCGC TTTCATTGGC CTACGCCGCC
CTCACCAAGC TCGACGAGAA GGGCAACCCG GTACCGGCGC TGGCCGAGTC GTGGAAGTAC
AACGCGGACG GCACCCAGGT CGCCTTCACT CTGCGGGCCG GCCTGACGTT CAGCGACGGC
ACGCCGCTGA ACGCCACCGC GGTCAGCAAG AGCCTCGCGC GGGGCCGGGA CTTCGCCGGG
TCGCTCGTCG CCGCGCAGCT GGCCAACGTG AAGACGTTGG CCGCGGACGA CGACGCCCGC
ACCGTCACCA TCGGACTGGC CGCCACGGAC TACCAGATCC CGAGCCTGCT CGCCGGCAAG
ACCGGCATGA TCGTCAGTCC GACGGCCTTC GAGAAGGACG CGAAGGGGCT GGCCACCAAG
CCGGTAGGGG CCGGGCCGTT CCGGCTCACG GAGTACGTGC CGAACCAGTC GGCGAAGCTC
GTCCGGTTCC CCGAGTACTG GGACAAGGCG AACATCCACC TCGACGCCTT CGAGCTGTAC
CCGGCGCCCG AGGCGGCGAC GGCGGTCCCG GCGCTGCAGT CCGGGCGGCT CGACGTCGCC
CAGATCCCGG GCAGCCAGGT CGAGGCGGCG AAGGCCGCGG GCCTTGAGGT GCAGATCATC
CCCTCGCTGG TGACGACGGT GCTCGACGTG AACATCACGA TGAAGCCGTT CGACAACCCG
AAGGTCGTCG AGGCGTTCAA GCACGCCCTC GACCGCAAGG CGCTCGCCGA CACCCAGACC
TTCGGGCTGG GCGTGGTCAA CTACCAGCCG TTCCCGCCGG GGTACATCGG CCACGAACCG
AGCCTGGAGA ACGCGTTTCC GTACGACCCG GAGAAGGCGA AGAAGCTGCT CGCCGAGGCC
GGGTTCCCGG ACGGAGTGGA GGTGCCGCTG ACCACCACGG GCGCCAGCTC CGCTCTTGCC
GAGCAGGTGC AGGCGCAGCT CGCCAAGGTC GGCGTGAAGA TCACCATCGA GACGATCCCG
GCGGCGCAGG CCACCCAGAT CATGTACATC CAGCACTCGA GGGCGCTGGC CACGGACGGC
TTCGCAGGTC GTGACTCGGC CGTGCAGGCC TTCCAGGTGC TGTTCGGCGA GCAGGGCCTG
ATGAACCCCG GCCGGCAGAC GCCCCCCGAA CTGACCGCGG CGCTGCAGAA GGTGCGGGAG
ACGCCGTTGG ACGATCCGTC GTATCCGACG GTGCTCCGGG CCGCCACGAA GATCGCGGTC
GAGAAGATGC CGAACATCTT CCTCTTCACC ACGCCGCGCG TTCTCGCCCG CAAGAAGAAC
GTCTCGGAGC TGGGCAGCTA CCTGGCGGTA CAGCGCTTCG AGGGCGTCCG GGTCGGGTAA
 
Protein sequence
MNAQLRADST LMGRRGFLGL GVLAGSSLLL AACGDDSGSV SSAGKEGGTL RWGWSAVTSW 
DPVTSSAGWD VHALSLAYAA LTKLDEKGNP VPALAESWKY NADGTQVAFT LRAGLTFSDG
TPLNATAVSK SLARGRDFAG SLVAAQLANV KTLAADDDAR TVTIGLAATD YQIPSLLAGK
TGMIVSPTAF EKDAKGLATK PVGAGPFRLT EYVPNQSAKL VRFPEYWDKA NIHLDAFELY
PAPEAATAVP ALQSGRLDVA QIPGSQVEAA KAAGLEVQII PSLVTTVLDV NITMKPFDNP
KVVEAFKHAL DRKALADTQT FGLGVVNYQP FPPGYIGHEP SLENAFPYDP EKAKKLLAEA
GFPDGVEVPL TTTGASSALA EQVQAQLAKV GVKITIETIP AAQATQIMYI QHSRALATDG
FAGRDSAVQA FQVLFGEQGL MNPGRQTPPE LTAALQKVRE TPLDDPSYPT VLRAATKIAV
EKMPNIFLFT TPRVLARKKN VSELGSYLAV QRFEGVRVG