Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2587 |
Symbol | |
ID | 7873328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2791891 |
End bp | 2792904 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699510 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002889566 |
Protein GI | 237653252 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.390541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGA AAGTCCTCAT CGCCTCGATC ATCGCCGCCT TCGGTCTCAC CGGCACCGTC ACCGCCGCCG AGCTGAACGT CTACTCCGCC CGCCACTACC AGACAGACGA AGAGCTCTAC GGCAATTTCA CCAAGCAGAC CGGGATCAAG ATCAACCGCA TCGAGGCGAA GGAAGACGAA CTGCTCGAGC GCATCCGCAA CGAGGGCGCC AACAGTCCGG CCGACATCTT CGTCACCGTC GATGCCTCGC GCCTGGCGAA GGCCGACGAA CTCGGCATCT TCCGGCCGGT GAAGTCCGCC GCGCTCGAGG CCCGCATCCC CGCCCACCTG CGCGCACCGA ACTGGTTTTC CTACTCGACC CGCGCACGCG TGATCGTCTA CAACCCGGAC ATGGTCAAGG CCGAGCAGGT GCAGACCTAC GAGCAGCTCG CCGACCCCGC GCTCAAGGGC CAGGTGTGCA CCCGCTCGGG CAGCCACCCC TACAACCTCT CGCTCGGCGC CGCCATGATC AAGCACAACG GCGCGGAAGC AACCGAGAAC TGGGCCCGGG GCATCGTCGC CAACTTCGCA CGCGCGCCCA AGGGCGGCGA CACCGACCAG ATCCGCGCCG TGGCCGCGGG CGAGTGCGGC GTGGCCATCG CCAACAGCTA CTACCTCGCC CGCCTGATGA ACTCGGACAA GCGCGAAGAC CAGGCCGTGG TCGCGAAGAT CAAGGCGGTG TGGCCGAACC AGGCGACCTG GGGCACCCAC ATCAACGTGT CCGGCGCCGG CATGCTCGAG CACGCGCCGA ACAAGGAGGC CGCGGTCAAG TTCCTCGAGT ATCTCGCCTC GGACCAGGCG CAGGAATACT TCGCCAACGG CAACAACGAA TGGCCTGCGG TGCCGAGCGT GAAGGTGGAC AACCCCGCGC TGAAGAAGCT CGGCGAGTTC AAGGCCGACA CCCTGCCGAT CGGCGAGCTC GCCGACACGG TGGCCGAGGC CCAGCGCATC TTCGACCGCG CCGGCTACCG CTGA
|
Protein sequence | MSKKVLIASI IAAFGLTGTV TAAELNVYSA RHYQTDEELY GNFTKQTGIK INRIEAKEDE LLERIRNEGA NSPADIFVTV DASRLAKADE LGIFRPVKSA ALEARIPAHL RAPNWFSYST RARVIVYNPD MVKAEQVQTY EQLADPALKG QVCTRSGSHP YNLSLGAAMI KHNGAEATEN WARGIVANFA RAPKGGDTDQ IRAVAAGECG VAIANSYYLA RLMNSDKRED QAVVAKIKAV WPNQATWGTH INVSGAGMLE HAPNKEAAVK FLEYLASDQA QEYFANGNNE WPAVPSVKVD NPALKKLGEF KADTLPIGEL ADTVAEAQRI FDRAGYR
|
| |