Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0515 |
Symbol | |
ID | 7085129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 580622 |
End bp | 582298 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643697543 |
Product | hypothetical protein |
Protein accession | YP_002354185 |
Protein GI | 217968951 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0189041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGCA CCCACCCCGA AGGCTGGCGC CTGCTCCCCG CGGAGGGCGC GCGCGGGCGC GAGCTCGCCA CCCTCGCCCG CCTCGCCGCC GCGTTGCCCG AGGACACCAC GATCTACCAC AGCCTGCACT GGACGCGCGC CGAGGGTCGC ACGGCGGTGT TCGGCGAGGT CGATTTCGCC GTGGTCGGCC CGGGCGGGCG GGTGCTGCTG GTCGAACAGC AGGCCGGCTT CCTCGACGAG ACGCCTACCG GCCTGCTGCG CCCCGGCAGG CGCCGCGCGG CGCAGCTCAG CGTCGAGCTC GCGCGCAACG CCGACGCCCT GCGCGCCCGC CTGCGCCCGC TGCTCGACGG CACCGAGCCC GTGCTCGACG TGCTGCTGCA CTGCCCCGAC CACTGCGTGC GCCAGCCCGG CACCGCCGGA CTCGACCCCG AGCGCATCGT CGACGCCGGC CGGCGCGACC AGCTCCCCGC GATCGTCGCC GCGCTGACGC GGCCGCAGGA GGGCGACCCG GCGCTCGACG CCGCGCACCG CCAGGCCCTC CACCGCTTCT TCGCCGACCT GCTCGAGCTG GTGCCGGACG TGCAGTCGCA CATCGGCCAG GTCGAAGACC TCACCACCCG GCTGTCGGGC GGGCTCACCG AGTGGGCGCG CCGCATCGTG GTCGAGCCGC ACCGCCTGCG CGTGACCGCC ACCGCCGGCA GCGGCAAGAC CCAGCTCGCG CTCGCCGCCT ACACCGACGC GCTCGCCGCC GGCCGGCGCC CGCTCTACGT GTGCTACAAC CGTCCGCTCG CCGACCACTT CGCGCGTATC GCGCCTGCCG GCGGCGAGAT CGCCACCTAC CACCAGCTCT GCGACCGCCG CGTGCGCGAC GCCGGCCGCA AGCCGGCCTT CGGCGCGCCC GGCGCCTTCC GCCGCATGGA GGCGGACTTC GCCACCCTGG TGCGCGAGCA GCCCGATCCC GCCTGGCAGT TCGACGAGCT CATCATCGAC GAGGGCCAGG ACTTCACCGA GCCCTGGCGC GATGCCCTGC TCGCGCTGCT GAAGCCGGCG GGGCGCGCCT GGTGGCTGGA AGACCCGCTG CAGAACCTCT ACGACCGTCC GCCGGTGGCG CTGCCCGGCT GGGTGGGGCT GAGCGCCGAC ACCAACTACC GCACCCCGGC CGACGTGCTC GCACTCCTCA ACGCCCGCCT GCCACTGCCC GCCCCGGTGC GCGCCGGCAG CCCGGTGACC GACAGCGAAC CGGAGATCTT CGCCTGGCGC GACGAAGCCG GGCTGATGGA TGCCACCAAA CGCGCGATCA CGCGCGCCCT CGGCCTGGGC TTCCGCCGCG ACGCGATCGT GCTGCTGACC TTCCGCGGCC GCGAGCATTC GCGCTTCACC CCGCTCGACC ACCTCGGCCC CCACCGCCTG CGCGCCTTCA CCGGCCGCTA CGACCTGCTC GGCGAGCCGG AGCACTCGCA GGGCGAACTG CTGATCGATT CGGTGTATCG CTTCAAGGGC CAGTCGGCGC CCTGCATCCT GTTTACCGAG ATCGACTTCG AGACGACGGG CGGGCGAATG GACGAGCTCA CCATGCGCAA GCTTTTCGTC GGCGCCACGC GGGCGACGAT GAAGTTGATC CTGGTGGCGT CGGAGCGTGC GGCGGCCGCG CTCGCGCCGG TCGCGGAAGC GGGCTGA
|
Protein sequence | MARTHPEGWR LLPAEGARGR ELATLARLAA ALPEDTTIYH SLHWTRAEGR TAVFGEVDFA VVGPGGRVLL VEQQAGFLDE TPTGLLRPGR RRAAQLSVEL ARNADALRAR LRPLLDGTEP VLDVLLHCPD HCVRQPGTAG LDPERIVDAG RRDQLPAIVA ALTRPQEGDP ALDAAHRQAL HRFFADLLEL VPDVQSHIGQ VEDLTTRLSG GLTEWARRIV VEPHRLRVTA TAGSGKTQLA LAAYTDALAA GRRPLYVCYN RPLADHFARI APAGGEIATY HQLCDRRVRD AGRKPAFGAP GAFRRMEADF ATLVREQPDP AWQFDELIID EGQDFTEPWR DALLALLKPA GRAWWLEDPL QNLYDRPPVA LPGWVGLSAD TNYRTPADVL ALLNARLPLP APVRAGSPVT DSEPEIFAWR DEAGLMDATK RAITRALGLG FRRDAIVLLT FRGREHSRFT PLDHLGPHRL RAFTGRYDLL GEPEHSQGEL LIDSVYRFKG QSAPCILFTE IDFETTGGRM DELTMRKLFV GATRATMKLI LVASERAAAA LAPVAEAG
|
| |