Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1342 |
Symbol | |
ID | 7084463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1488083 |
End bp | 1489132 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698359 |
Product | allantoicase |
Protein accession | YP_002354997 |
Protein GI | 217969763 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG4266] Allantoicase |
TIGRFAM ID | [TIGR02961] allantoicase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0206885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAGCC ACACCCCCGG CGCCCCCGTC TTCGCCCCCG CTGCCGAGCT GCCCGACTGG GCGCTGCGCT CGGTCGACCT CGCCAACCCG CGCCTCGGCG CCAAGGCGAT CGCCGCCTCC GACGACTTCT TCGCCGAAGT CGGCCGCATG CTCAACCCCG AACCCGCGCA GTTCGTCCCC GGCAAGTTCG ACACCAACGG CAAGTGGATG GACGGCTGGG AGTCGCGCAG GAAGCGCGTC GCCGGCCACG ACTGGGCGCT CGTGAAACTC GGCGTCAAGG GCGTGATCCG CGGCTTCGAC GTCGACACCT CCCACTTCAC CGGCAACTAC CCGCCCGCGG TGTCGATCGA GGCCTGCGTG TCCGAGACCG ACGACGTCGC CGCCCTGCAG GCGGCCGAGT GGACCGAGAT CCTCCCCGCC AGCCCGATGG GCCCCAACAG CCACCACCTT CTCGAGTGCG CCAGCACCGC GGCGTGGACC CACCTGCGCG TGAACATGTA CCCCGACGGC GGCATCGCCC GCCTGCGCGT CTACGGCCGC CCGGTGGGCA CACTCGCGGC CAAGGCCGCC GGCAGCGAGC TGGTCGACCT GGTAGCGATG GAGAACGGCG GCCGCGCGCT GTCGTGGAAC GACGCCAGCT TCGGCTCATC CGCCGCCGCC CTGCTGCTGC CCGGGCGCGG CATGAACATG GGCGACGGCT GGGAGACCCG ACGCCGCCGC GAGCCCGGCA ACGACTGGTG CGTGCTCGAG CTCGGCGCCC CCGGCACGGT CGAGCGCATC GAGGTCGACA CCGCCTTCTT CAAGGGCAAC TACCCCGACC GCTGCTCGGT GCAGGCCGCC TACGTCAGCG GCGGCACCGA CCGCTCGATC ACCACCCAGT CGATGTTCTG GCAGACCCTG CTGCCCGAGC AGAAGCTGGA GATGGACGCG ATCCATACCT TTACGGAGCA GGTCGCGAAA CTGGGGCCGA TCACCCACGT CCGCTTCAAC ATCTTCCCCG ACGGCGGCGT CTCGCGCCTG CGCCTGTGGG GCAAGGTCGA GGCGAAGTGA
|
Protein sequence | MASHTPGAPV FAPAAELPDW ALRSVDLANP RLGAKAIAAS DDFFAEVGRM LNPEPAQFVP GKFDTNGKWM DGWESRRKRV AGHDWALVKL GVKGVIRGFD VDTSHFTGNY PPAVSIEACV SETDDVAALQ AAEWTEILPA SPMGPNSHHL LECASTAAWT HLRVNMYPDG GIARLRVYGR PVGTLAAKAA GSELVDLVAM ENGGRALSWN DASFGSSAAA LLLPGRGMNM GDGWETRRRR EPGNDWCVLE LGAPGTVERI EVDTAFFKGN YPDRCSVQAA YVSGGTDRSI TTQSMFWQTL LPEQKLEMDA IHTFTEQVAK LGPITHVRFN IFPDGGVSRL RLWGKVEAK
|
| |