Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5340 |
Symbol | |
ID | 5673674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6437373 |
End bp | 6438449 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641244198 |
Product | extracellular solute-binding protein |
Protein accession | YP_001509604 |
Protein GI | 158317096 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.927258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCACACC GTCACGCGCG CACGCCAGAC CCGTCCGGTC TGTGCTCAGG GAAAGGGAGT CTGATGGGCA GGGCGAGAAG AGCGCTGGCC GTGCTGGCGA CGGCCATCCT CGTGGGGGGC CTGTCGGCGG CGTGCGGCGG GGATGACGGG AAGACGCTCA CGCTCTACAA CGCGCAGCAT CGGGACCTGA TGCAGGTGCT GGTCGACGCG TTCACCAAGG AGACCGGCAT CAAGGTCGAG ATGCGTAACG GCGGTGACGC GGAGCTTGCG AACCAGATCG TCCAGGAGGG CGACAGCTCG CCCGCGGATC TGTTCGCCAC CGAGAACTCG CCGGCCATGA CGCTGGTCGA CCGGGCTGGC GGGTTCAGCC CGCTCGACCA GGCCACCCTC GACCAGATGC CCGACCAGTA CGTCCCGAGC TCCGGCACCT GGGTCGGCTT CGCGGCGCGG TCGACGGTGT TCATCTACAA CCGCGACCAG GTCGACAAGG ACGCGCTACC AACGTCGATC ATGGATCTGG CCCGGCCGGA GTGGCAGGGT CGAGTGGGCG TCGCGGCCGG TGGCGCCGAC TTCCAGGCCA TCGTCAGCGC TGTGCTCGCG GTGGAGGGCG AGGACGCTGC CGCCGACTGG CTCGCCGGAC TGAAGCGCAA CGCCAAGATC TACGACAACA ACATCGCCGC GCTGCGTGCC GTGAACGCCG GCGAGGTGCC CGCCGCCGTG ATCTACCACT ACTACTGGTA CCAGGACCAG GCGGAGTCGG GAGAGATCAG CAGGAACGTC GACCTGCACT TCTTCGGGAA CCAGGACGCG GGCGCGTTCC TCAGCGTCTC CGGCGTCGGC GTGATCGCGG CCAGCGACCA GCAGGCCGAG GCGCAGCAAC TGGTCAGGTT CCTCACCAGC GAAGCCGGGC AGCGGGCGCT CGTCGACAGC GGCGCCCTGG AGTACGCCGT GTCGGACAAG GCCCCCACAA ACCCCGCGCT GACGCCCCTG GCGGACCTCG ACGCACCGCA CATCGACATC TCGACCCTGA ACGGCCCGAA GGTCATCGAG CTGATGCAGC AGGCGGGTCT GCTCTGA
|
Protein sequence | MPHRHARTPD PSGLCSGKGS LMGRARRALA VLATAILVGG LSAACGGDDG KTLTLYNAQH RDLMQVLVDA FTKETGIKVE MRNGGDAELA NQIVQEGDSS PADLFATENS PAMTLVDRAG GFSPLDQATL DQMPDQYVPS SGTWVGFAAR STVFIYNRDQ VDKDALPTSI MDLARPEWQG RVGVAAGGAD FQAIVSAVLA VEGEDAAADW LAGLKRNAKI YDNNIAALRA VNAGEVPAAV IYHYYWYQDQ AESGEISRNV DLHFFGNQDA GAFLSVSGVG VIAASDQQAE AQQLVRFLTS EAGQRALVDS GALEYAVSDK APTNPALTPL ADLDAPHIDI STLNGPKVIE LMQQAGLL
|
| |