Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3878 |
Symbol | |
ID | 7873529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4275906 |
End bp | 4278512 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700820 |
Product | protein of unknown function DUF404 |
Protein accession | YP_002890843 |
Protein GI | 237654529 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGTGC AGCCGCACAG CCCGCCCGCG CTGCTCGCCC GCTACGTCGC GCGGCCCGAC CGCTACGACG AGCTCTGCCA ACGCATCGGC AGCGTCGTGG TGCTGCGGCC GCACTGGCGG GACTTCTTCC ACGGCCTCGC CGCCATGCCC GCCGACGAGC TGTCGGCACG GCGGGCCGCG CTCGGCCGGC AGATCCACGA GAACGGCATC ACCTACAACG TGTATGCCGA TCCGCGCGGC TTCGCGCGCC CGTGGGAGCT CGACCTGCTG CCCTACATGC TGCCCGCGGC GGAGTGGGCG CAGATCGAGG CCGCGGTGAT CCAGCGCGCC ACCCTGTTCG ACCGCATCCT CGCCGACCTC TACGGCGAGC AGACCCTGCT CGAGCAGGGC CTGGTGCCGC CCGCGCTCAT CTATGGCCAT AGCGGCTTTC TGCGCCCGCT CGTGGGCGCG CGGCCGGCGG GCGGGCGCTT CCTGCACCTG TACGCGGTGG ACCTGGCGCG CTCGCCCGAC GGGCGCTGGT GGGTGGTGGC CGACCGCACC CAGGCGCCCT CGGGTGCGGG CTACGCGCTG GAGAACCGCA GCGTGGTCGC GCGCGTGCTG CCCGAGCTCT ACCGCGCCGC CGGCGTCGAG CCGCTGGTGC CCTTCTTCGA GGGCTTCCGC GACGGCCTCG TCGAGCTCGC GCCGGCCGAC GGTGCGGCGG CCGGGGAAGA TCCCCTGATC GTGGTGCTCA CGCCCGGGCC CTACAACGAG ACCTACTTCG AGCACGCCTT CCTCGCCCGC GAGATGGGTT TCCCGCTCGT CGAGGGCCAG GACCTCACCG TGCGCGACGA CAAGGTCTGG TTGCGCACGC TCGAGGGCCT GCGCCGGGTG CATGTGATCC TGCGCCGGGT GGACGACCTG TGGTGCGATC CGCTCGAACT GCGCGAAGAC TCCGCGCTCG GCGTCGCCGG CCTGGTGGCG GCGGTGCGTG CCGGCATGGT GACGGTGGCC AACGCGCTCG GCAGCGGCAT CCTCGAGACC GGGGCGTTGC TCGGCTACCT GCCGCGGCTG TGCGAGCACC TGCTCGGCAG CAAGCTGCGC ATGCCCTCGG TGGCGACCTG GTGGTGCGGC GAGCCGGCGG CCTGCGAATA CGCGCTGAAG CACCTGCGCG AGCTGGTGAT CAAGCCGGCC TACCCCACCC TGGGTGCGCG GCCGGTGTTC TGCGGCGACC TGCCGGTGGC AGAGCTCGAC GCGCTGGCGG CACGGATCGC GCTGCGGCCC TTCGACTTCG TCGCGCAGGA GATGGTCAAC CTCTCGCAGG CCCCGGTGCT GGCGGACGAG GCGGGCGGCG CCGGGAACAA GGCGCCCGCG CGCGACGCGC GCGCGCTGGT CGCGCGCAAC ATCGGGCTGC GCGTGTTCGC GGCGGCGGGG GCGGAGGGGT TTCGCGTGCT GCCGGGCGGG CTCGTCCGGG TGGCCTCGGA GGCCGACATG CGCATCGTGT CGATGCAGCA CGGCGGCGGC AGCAAGGACG CCTGGGTGCT CGGCTCGCCG GCGCGGCCGC GTCCGGTGCC GCGCATCGTG CCCGGCCAGC TCGCGCCCTT TGCCGGGGGG GCGCGCGTGC TCGGCCTGTC GAGCCGGGTG GCGGAGAACT TCTTCTGGCT CGGACGCTAC AGCGAGCGCG CCGACGCTGC CGCCCGCCTC GGCCGCGAGA CGCTGGCGCG CCTGGGCGAG GCCGGCGAGG CCTCGTGGCT GGGCGAGGAT GGCAAGGTGA CCGCGGCCCT GCAGGCGCTG TGCCGGCAGC GCGGCCTGCT CGCCACCGTG GCGGGGGAGG CCGACGCGGC ACCCACACCC CCCGCGCCGC TCGCTCAGGC GCTCTTCGAT CCCGCCCAGC CCGGCAGCGT GGTGGCCAAC CTGCGCCAGG TGCTGCGCGT CGCAGCCCTG GTGCGCGACC GCCTGTCGCC GGACAGCTGG CGCATCTACA ACCGCCTGTC GGAGTTTGCC GCGGCGCCGG CTCAGGCCCC GCGCCTCGGC GAGGCGCTGC AGCGCCTGGA CGAGTCGTTG CTGTCGCTGG TGACGCTGTC GGGTTTCGTC ATCGAGAGCA TGCCGCGCGA CGCCGGCTGG CGCTTCCTGT CGATCGGGCG GCGCATCGAG CGCCTGCAGT TTCTCGCCGC CGCGCTGTCG GCGCTGCTGC TCGAGCCCTC GCAAGGCGGG CTCAAGGTGC TGCTCGCGAT CACCGACGCC GAGCTGCGCT ACCGCAGCCG CCACGCGCGC GGCCTGGCCC CGCAGCCGGT GGCCGAGCTG GTCATGCTCG ACGGCGACAA CCCGCGCGCG CTGCGCTACC AGCTCGATTC GCTCGCCGAG CACGTCGCCC TGCTGCCCGA CGGCGCCGCG CTCGCCGCGC CGCTGCGCGA ACGCCTGGCC GCGCTCGACG CGCTCGCGCT GTGCGACTGC TTCGTGCACG AGCTGGCCGG CGCGCGTAGC CGCGCGGCCG CACAGGCGCG CCACTGCGAG CCCCTGCGGC AGGTGCTGGA CGACATCTGG CGCGGCGGCA ACGAGCTCGC CGATGCGCTG GCGCGGCGCT ACTTCACCCA CCTCGACGCC CGCAGCCGGG CGACCGTGTC GCTGTAG
|
Protein sequence | MQVQPHSPPA LLARYVARPD RYDELCQRIG SVVVLRPHWR DFFHGLAAMP ADELSARRAA LGRQIHENGI TYNVYADPRG FARPWELDLL PYMLPAAEWA QIEAAVIQRA TLFDRILADL YGEQTLLEQG LVPPALIYGH SGFLRPLVGA RPAGGRFLHL YAVDLARSPD GRWWVVADRT QAPSGAGYAL ENRSVVARVL PELYRAAGVE PLVPFFEGFR DGLVELAPAD GAAAGEDPLI VVLTPGPYNE TYFEHAFLAR EMGFPLVEGQ DLTVRDDKVW LRTLEGLRRV HVILRRVDDL WCDPLELRED SALGVAGLVA AVRAGMVTVA NALGSGILET GALLGYLPRL CEHLLGSKLR MPSVATWWCG EPAACEYALK HLRELVIKPA YPTLGARPVF CGDLPVAELD ALAARIALRP FDFVAQEMVN LSQAPVLADE AGGAGNKAPA RDARALVARN IGLRVFAAAG AEGFRVLPGG LVRVASEADM RIVSMQHGGG SKDAWVLGSP ARPRPVPRIV PGQLAPFAGG ARVLGLSSRV AENFFWLGRY SERADAAARL GRETLARLGE AGEASWLGED GKVTAALQAL CRQRGLLATV AGEADAAPTP PAPLAQALFD PAQPGSVVAN LRQVLRVAAL VRDRLSPDSW RIYNRLSEFA AAPAQAPRLG EALQRLDESL LSLVTLSGFV IESMPRDAGW RFLSIGRRIE RLQFLAAALS ALLLEPSQGG LKVLLAITDA ELRYRSRHAR GLAPQPVAEL VMLDGDNPRA LRYQLDSLAE HVALLPDGAA LAAPLRERLA ALDALALCDC FVHELAGARS RAAAQARHCE PLRQVLDDIW RGGNELADAL ARRYFTHLDA RSRATVSL
|
| |