Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0351 |
Symbol | |
ID | 7084857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 398059 |
End bp | 399225 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643697384 |
Product | hypothetical protein |
Protein accession | YP_002354032 |
Protein GI | 217968798 |
COG category | [R] General function prediction only |
COG ID | [COG3973] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0328547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAACA GAAGTTGGTG GCGTTCGAAA AACGAACTTG ATGAAGATCA GAAGGCCTTC ATTCAATTGC CAGCCCATGG TCGCTACTCC CTGGTCGGAC CGCCGGGTTC CGGAAAAACA AATCTCTTGC TGCTACGTGC GCAATTCATT GCCGGTATGG GTGAAAAGAA TGTCTTGATC GTTACCTACA CCAAGGCACT TGCCAATTTC ATCCGCTCCG GGATCGGCGC AAGCGGATTG ATTTCCCCGA ATCAAGTCAG AACCTACCAC TCATGGGCTT CATCGCACAT TCTCGAAAAT CTCGGGCAAC GTGCGGTTCC AAAAGGCGCT GACTTCGATG ACGCCACACG ACCAGCAATT CTTGAAATGC TGCGCGAGGC CAATGCCAAG CTGCCTACAA AACAGCTTTT CAGCGCCATC TTCGTCGACG AAGCACAAGA TCTTACCGCG GATGAACTCG AAGTTCTTTT GTCTCTAAGC GACAACGTCG CCATCTGTGG CGACGCGCGG CAGGGAATTT ATGACAAAAA TGGATTGTCG GCCGTCGAGA AACTTGCCTT GCAACCGCAC ACGTTGAAGC GACACTTCCG TATCGGCCAG AGGATCGCCA AGGTCGCAGA CAGGCTCATG CCACCTGAGA ATCCGGCGGA CAGCCTGGAA GCGATGTCCA ATTACGACCT GAAAGCACAG GGCGAATCGA GCGCACACAT GAATCCATGT GCGAACCGCG ACGAGCAGTT CGAGAAAATG CTCGAGAAAA TCGAGATTCA GCTCGATGCC TTCAAGGACG ACACCATCGG TATCTTCTGC GGCAAGCGCG AAACCCTCGA AGATCTGCGC ATCCGTTTTA ACAAGACCAA GCTCTCGAAG CAGGTCTGCG TACACGGCGT GGATGACGAC TCTAGTTTTT CTGACAACAA GCCCATCCAT GTGCTTACTA TTCACGCTTC GAAGGGCACC GAGTTCCGTG CGGTTCACCT GTTCGCAGTC GAAGAGCTGG CCAGCTACCC ACTGAACCGC CGCCGTCTGG GCTTCACGGC CATCACTCGT GCCAAGACGG CCCTCAACGC GTACCGCACC GGGGACACGA ATCAACCGCT GGAAAATGCG TTTGCGCAAC CCCAGCACAT GGAGCTGGAC GACCTTTTCC CTGGTGACAA ATCATGA
|
Protein sequence | MMNRSWWRSK NELDEDQKAF IQLPAHGRYS LVGPPGSGKT NLLLLRAQFI AGMGEKNVLI VTYTKALANF IRSGIGASGL ISPNQVRTYH SWASSHILEN LGQRAVPKGA DFDDATRPAI LEMLREANAK LPTKQLFSAI FVDEAQDLTA DELEVLLSLS DNVAICGDAR QGIYDKNGLS AVEKLALQPH TLKRHFRIGQ RIAKVADRLM PPENPADSLE AMSNYDLKAQ GESSAHMNPC ANRDEQFEKM LEKIEIQLDA FKDDTIGIFC GKRETLEDLR IRFNKTKLSK QVCVHGVDDD SSFSDNKPIH VLTIHASKGT EFRAVHLFAV EELASYPLNR RRLGFTAITR AKTALNAYRT GDTNQPLENA FAQPQHMELD DLFPGDKS
|
| |