Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3876 |
Symbol | |
ID | 7873527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4273343 |
End bp | 4274773 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700818 |
Product | protein of unknown function DUF404 |
Protein accession | YP_002890841 |
Protein GI | 237654527 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCG CCAAGCCTTT CGATGAGATG CACGCCGCCG ACGGCACGAT CCGGACGCAC TACCAGCCCT ACAGCGAGTG GCTCGGGCAC ACCCCGCGCG AGCTGATCGC GCGCAAGCGC AAGGAGGCCG ATCTCGCCTT CCACCGCGTC GGCATCACCT TCAACGTCTA TGGCGCCGAC GGCGGCAAGG AGCGCCTGAT CCCCTTCGAC CTGCTGCCGC GCATCATCCC CGGCGACGAA TGGCGCACGC TCGAGGCCGG CCTGCGCCAG CGCGTGCGCG CGCTCAACGC CTTCCTCGCC GACATCTACC ACGGCCAGGA GATCCTGCGC GCCGGACGCA TCCCCGCCGA CCAGGTGCTG GACAACGCCC AGTTCCGCCC CGAGATGAAG GGTGTGCACG TGCCCGGCGG GCTGTACGCG ATGATCGCCG GCATCGACCT GGTGCGTGCC GCGGGTGCCG ACGGCAAGGG CGACTACTAC GTGCTGGAGG ACAACCTGCG CGTGCCCTCG GGGGTGAGCT ACCTGCTGGA GAACCGCAAG ATGATGATGC GGCTCTTCCC CGACCTGTTC TCGCGCTACG ACGTGCAGCC GGTCGAGCAC TACCCCGACC TGCTGCTCGA GACCCTGCGC GCGGTGGCGC CGGCCGGCGT ACTCGACCCC ACCGCGGTGC TGCTCACCCC GGGCGCCTTC AACAGCGCCT ACTTCGAGCA CAGCTTCCTC GCCCAGCAGA TGGGCATCGA GCTCGTCGAG GGCCAGGACC TCTTCGTCGA GGACGACACC GTGTTCATGC GCACCACCCA GGGGCCGCGG CGGGTGGATG TGATCTACCG GCGGCTGGAC GACGACTTCC TCGACCCCGA GGTCTTCCGC GCCGACTCGA TGCTCGGTGT GCCGGGCCTG ATGTCGGCCT ACCGCGCCGG CCGGGTGACG CTGGCCAATG CGGTCGGCAC GGGGGTGGCG GACGACAAGT CGATCTACCC CTACGTGCCC GAGATGGTGC GCTTCTACCT CGGCGAGGAG CCGATCCTCA ACAACGTCCC GACCTGGATG CTGCGCGAAC CCGACGACCT CGCCTACACG CTCGCCCACC TGCCCGAGCT GGTGGTCAAG GAGGTCCATG GCGCCGGCGG CTACGGCATG CTGGTGGGGC CGGCGGCCAC CAAGGCCGAG ATCGAGGAGT TCCGCCAGCG CATCCTCGCC GCGCCGGAGA AGTACATCGC CCAGCCGACC TTGTCCTTGT CCACCTGCCC GACCTTCGTC GACGCCGGCA TCGCGCCGCG CCACATCGAC CTGCGACCCT TCGTGCTCTC GGGCGGACGC GAGATCCGCA TGGTGCCGGG CGGCCTCACC CGCGTCGCGC TCAAGGCCGG CTCGCTGGTG GTCAACTCCT CGCAGGGCGG GGGCACCAAG GACACCTGGG TGGTGGGCTG A
|
Protein sequence | MNAAKPFDEM HAADGTIRTH YQPYSEWLGH TPRELIARKR KEADLAFHRV GITFNVYGAD GGKERLIPFD LLPRIIPGDE WRTLEAGLRQ RVRALNAFLA DIYHGQEILR AGRIPADQVL DNAQFRPEMK GVHVPGGLYA MIAGIDLVRA AGADGKGDYY VLEDNLRVPS GVSYLLENRK MMMRLFPDLF SRYDVQPVEH YPDLLLETLR AVAPAGVLDP TAVLLTPGAF NSAYFEHSFL AQQMGIELVE GQDLFVEDDT VFMRTTQGPR RVDVIYRRLD DDFLDPEVFR ADSMLGVPGL MSAYRAGRVT LANAVGTGVA DDKSIYPYVP EMVRFYLGEE PILNNVPTWM LREPDDLAYT LAHLPELVVK EVHGAGGYGM LVGPAATKAE IEEFRQRILA APEKYIAQPT LSLSTCPTFV DAGIAPRHID LRPFVLSGGR EIRMVPGGLT RVALKAGSLV VNSSQGGGTK DTWVVG
|
| |