Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2792 |
Symbol | |
ID | 5671181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3300183 |
End bp | 3301268 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641241701 |
Product | hypothetical protein |
Protein accession | YP_001507121 |
Protein GI | 158314613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGACG GGGGCGACTA CGAACTCGTC TGGCCACGGC ACCTGTTCGT CCACGAAGCG AGCAACCTGC TCAACCACCG CAGGATCCAC CGTGACTGGG ATGACCGCTG CCTACTACTT CTCGATCACG CGTTCGCGGG CCCTACTCCC CAGGACGACT TCCGTCAGGC AGCCGCACAG TCCCCGCCTC CACGTGGCCT GAGCAACGGG CAGGCATTCC TCCGCGACCT GATGTCCAGC TCCGAACAGC TCCGCGAGAC CACAACACCG CAGTACCGCC CGTACTGGTC CGAACGCCGG GCGGGTACCT CCCCTGACCG TGCCGGTCTG CGTGCCACAG CCCGCCAGTT CATCGATATC GTCAGCCATC TCAACAACCA CGGATACTTC GAGCAGGCGT TCGGTAAGGA CTGCGTCGAC GACCCCAGCG AGATCGACCC CTCGGCCGTC ATCGAGCGCG CCATCGGCGC CGCAGACCTG TGGCCGCTGA CGCCGGACCG ACTCGCACAG AACATCGACG TGTTCTGCGA CGTGGTCGAA GTGCTCCACG ATCTGGTAGC GCGCCCCCGA TCTCGCGGAC TACACGACTA CGACGGATGC GGCTGGCACT ACCGCGATTT CTCTCCCGCC ACAGGCCGCG TCGTCTACCG GTGGCGCGTC AATGGTCTGC TCGAACGAAG CGACCTCGGC CTCCACCTCG CGGACGAAGG CGAAGACGTC GGTCGCCTGG TCACCAGCAC CGATCCCGCC CGATCGGACC TCCTGAGCCG CATGGCCCAG CGAGAAAGCC CGGCCGCTGA CCGGCTCCGC CACGCCATCA GCCTGTACCG GGCACGGCAC GCCGACGAAC ACACCAAGAG ATCCGCGGTC GTCGTCCTCA GTGGCGTTCT CGAAGAACGC CGACAGCTGA TCAAAGATGA GCTGCTCAGC AAGGACGAAG GTGACCTCTT CACGATCGCG AACAAGTTCG CGATCCGGCA CCAGAACGAA CAACAGAAAA CCGACTATAG CGCCGAGTTC CTCGACTGGA TCTTCTGGTG GTACCTCGCG ACGATCGAGC TCACCGACCA TCTCCTCGCA CGCTAA
|
Protein sequence | MYDGGDYELV WPRHLFVHEA SNLLNHRRIH RDWDDRCLLL LDHAFAGPTP QDDFRQAAAQ SPPPRGLSNG QAFLRDLMSS SEQLRETTTP QYRPYWSERR AGTSPDRAGL RATARQFIDI VSHLNNHGYF EQAFGKDCVD DPSEIDPSAV IERAIGAADL WPLTPDRLAQ NIDVFCDVVE VLHDLVARPR SRGLHDYDGC GWHYRDFSPA TGRVVYRWRV NGLLERSDLG LHLADEGEDV GRLVTSTDPA RSDLLSRMAQ RESPAADRLR HAISLYRARH ADEHTKRSAV VVLSGVLEER RQLIKDELLS KDEGDLFTIA NKFAIRHQNE QQKTDYSAEF LDWIFWWYLA TIELTDHLLA R
|
| |