Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2510 |
Symbol | |
ID | 5670906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2988527 |
End bp | 2990086 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641241427 |
Product | extracellular solute-binding protein |
Protein accession | YP_001506848 |
Protein GI | 158314340 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGCAC AACTCCGAGC CGATTCGACG CTAATGGGCC GCCGTGGCTT CCTCGGTCTC GGCGTGCTCG CTGGCTCCTC CCTGCTGCTC GCCGCCTGCG GCGACGACAG CGGTTCGGTC TCGTCCGCCG GCAAGGAGGG CGGGACGTTG CGGTGGGGCT GGTCCGCGGT CACCTCCTGG GACCCGGTGA CCTCGTCCGC GGGCTGGGAC GTGCACGCGC TTTCATTGGC CTACGCCGCC CTCACCAAGC TCGACGAGAA GGGCAACCCG GTACCGGCGC TGGCCGAGTC GTGGAAGTAC AACGCGGACG GCACCCAGGT CGCCTTCACT CTGCGGGCCG GCCTGACGTT CAGCGACGGC ACGCCGCTGA ACGCCACCGC GGTCAGCAAG AGCCTCGCGC GGGGCCGGGA CTTCGCCGGG TCGCTCGTCG CCGCGCAGCT GGCCAACGTG AAGACGTTGG CCGCGGACGA CGACGCCCGC ACCGTCACCA TCGGACTGGC CGCCACGGAC TACCAGATCC CGAGCCTGCT CGCCGGCAAG ACCGGCATGA TCGTCAGTCC GACGGCCTTC GAGAAGGACG CGAAGGGGCT GGCCACCAAG CCGGTAGGGG CCGGGCCGTT CCGGCTCACG GAGTACGTGC CGAACCAGTC GGCGAAGCTC GTCCGGTTCC CCGAGTACTG GGACAAGGCG AACATCCACC TCGACGCCTT CGAGCTGTAC CCGGCGCCCG AGGCGGCGAC GGCGGTCCCG GCGCTGCAGT CCGGGCGGCT CGACGTCGCC CAGATCCCGG GCAGCCAGGT CGAGGCGGCG AAGGCCGCGG GCCTTGAGGT GCAGATCATC CCCTCGCTGG TGACGACGGT GCTCGACGTG AACATCACGA TGAAGCCGTT CGACAACCCG AAGGTCGTCG AGGCGTTCAA GCACGCCCTC GACCGCAAGG CGCTCGCCGA CACCCAGACC TTCGGGCTGG GCGTGGTCAA CTACCAGCCG TTCCCGCCGG GGTACATCGG CCACGAACCG AGCCTGGAGA ACGCGTTTCC GTACGACCCG GAGAAGGCGA AGAAGCTGCT CGCCGAGGCC GGGTTCCCGG ACGGAGTGGA GGTGCCGCTG ACCACCACGG GCGCCAGCTC CGCTCTTGCC GAGCAGGTGC AGGCGCAGCT CGCCAAGGTC GGCGTGAAGA TCACCATCGA GACGATCCCG GCGGCGCAGG CCACCCAGAT CATGTACATC CAGCACTCGA GGGCGCTGGC CACGGACGGC TTCGCAGGTC GTGACTCGGC CGTGCAGGCC TTCCAGGTGC TGTTCGGCGA GCAGGGCCTG ATGAACCCCG GCCGGCAGAC GCCCCCCGAA CTGACCGCGG CGCTGCAGAA GGTGCGGGAG ACGCCGTTGG ACGATCCGTC GTATCCGACG GTGCTCCGGG CCGCCACGAA GATCGCGGTC GAGAAGATGC CGAACATCTT CCTCTTCACC ACGCCGCGCG TTCTCGCCCG CAAGAAGAAC GTCTCGGAGC TGGGCAGCTA CCTGGCGGTA CAGCGCTTCG AGGGCGTCCG GGTCGGGTAA
|
Protein sequence | MNAQLRADST LMGRRGFLGL GVLAGSSLLL AACGDDSGSV SSAGKEGGTL RWGWSAVTSW DPVTSSAGWD VHALSLAYAA LTKLDEKGNP VPALAESWKY NADGTQVAFT LRAGLTFSDG TPLNATAVSK SLARGRDFAG SLVAAQLANV KTLAADDDAR TVTIGLAATD YQIPSLLAGK TGMIVSPTAF EKDAKGLATK PVGAGPFRLT EYVPNQSAKL VRFPEYWDKA NIHLDAFELY PAPEAATAVP ALQSGRLDVA QIPGSQVEAA KAAGLEVQII PSLVTTVLDV NITMKPFDNP KVVEAFKHAL DRKALADTQT FGLGVVNYQP FPPGYIGHEP SLENAFPYDP EKAKKLLAEA GFPDGVEVPL TTTGASSALA EQVQAQLAKV GVKITIETIP AAQATQIMYI QHSRALATDG FAGRDSAVQA FQVLFGEQGL MNPGRQTPPE LTAALQKVRE TPLDDPSYPT VLRAATKIAV EKMPNIFLFT TPRVLARKKN VSELGSYLAV QRFEGVRVG
|
| |