Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0320 |
Symbol | |
ID | 7085621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 361755 |
End bp | 363740 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697357 |
Product | hypothetical protein |
Protein accession | YP_002354005 |
Protein GI | 217968771 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3843] Type IV secretory pathway, VirD2 components (relaxase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC GCCGCGACGA CGATTTCCGC GTGCGCCCCA GCGCCCCGAA GAACCGGGGC AAGGGCCAGG GGCAGAGCTT CGTTTCCAAG GTACTCAAGC AGGCGGGCAA GGCCAGCAGC GGCAAGTCCA CGGTGCGCCT CCCAGCATCG GCGCGTGGCA CCGGCCAGCG CCCCGGCTCG CGCCTGGGGC GCGGCCACAC GGCGGCGCGC TTCGCCAGGG CGAAGCTGAC GCCCATGTCG CGGCGCGTGA CCATCAAGAC GCTGCTGGTG AATCACCAGC GGGCCAGCCC GCAGTCGCTC GCCAAGCACC TGCGCTACAT CGAGCGCGAT GGCGTGGGGC GCGACGGCGA GCCGGGCCAA GCCTACGGGC CGCAGACCGA TACCGCCGAC CTCGACGCCT TCAAGGAACG CTGCGCCGAC GACCGGCACC ATTTCCGCTT CATCCTCTCG CCCGAGGATG GCGCGGAGCT GGAAGACCTG CGCACCTACA CGCGGCACCT CATGGGCCGC ATGGAGGCCG ACCTGGGCAC GGGCCTGGAT TGGGTGGCCG TCAACCACTG GAACACCGAC AACCCGCACA CGCACATCGT CGTGCGCGGG CGCGACGACA TCGGCAAAGA CCTCATCATC GCGGGCGACT ACATCGCCGA CGGTTTCCGC CACCGCGCCG CAGAGCTGGC GACCGAATGG CTGGGGCCGC GCACCGAACT GGAGATCCAG CAGACCTTGC AGCGCGAGGT GGAGCAAGAG CGGTGGACGA GCCTGGATCG CACCTTGAAG CGCGAAGCCG GCGACGATGG CCTGTTGCAT GTCGAACGGC TCAACGAACC CCACTTGCAG CGCCAGCGCC TGCTGCTGAT CGGCCGCCTG CAACGCTTGC AGCGCCTGGG CCTGGCCGAC GAGACGCAGC CCGGCACCTG GGCCGTCCAT GCCGATGCGG AAAAGACCCT GCGTGCCCTG GGCGAGCGCG GCGACATCAT CCGCACCATG CAGCGGGCCA TGCGCGGCGA GCCGCGCGAG CTGGCGGTGT TCGAGCCGGG CGACGACGGG CAAACCATCC TCGGGCGCGT GGCCGCGAAG GGACTGGCCG ACGAGCTGCG CGACCGGGGC TATCTGGTCA TCGACGGCGT GGATGGCAAG GCGCACTACG TCGCGCTCAA CGCCCACGAC GAACTGGCGA ACTATCCGAC CGGCGCCGTG GTGGAGGTCA AGGGATCGGC CGACGTGCGC GCAGCCGACA GGAACATCGC CGCGCTGGCG AGCGATGGCC TGTACCGCAC CGATCACCAC TTGGCGATCG AGCAAGGCCG AGCCAAAGCC GGACGCGATC CGCAGGAGGT TGTCGCCGCC CACGTCCGCC GGCTGGAAGC CCTGCGCCGG GCCGGCATTG TGGAGCGCGT GGCGGAAGGG CTATGGAAGG TGCCGGACGA CCTGGCCGAG CGTGGCCGCC AGTACGACGC GCAGCGCCTG GGCGGCGTGG CTGTGGAACT GAAATCGCAC TTGCCCATCG AGCGGCAGGC CCGCGTGATC GGTTCCACCT GGCTCGACCA GCAGTTGATC GGTGGCGGCT CGGGCCTGGG CGACCTGGGC TTTGGCGGCG AGGCCAAGCA GGCCATGCAG CAGCGCGCCG ACTTCCTGAC CGAACAAGGG CTGGCCGAGC GGCGCGGGCA GCGCGTGATC CTCGCCCGGA ACCTGCTGGG CACGTTGCGC AATCGGGAAC TGGCGCAGGC CGCCAAGGAC ATTGCCGCCG AATCCGGCTT GGAGCACCGT CCGGTGGCCG ACGGCCAGCG CGTGGCCGGT ATCTACCGGC GCAGCATCAT GCTCGCCAGC GGGCGCTACG CGATGCTCGA TGACGGCATG GGGTTCAGCT TGGTGCCGTG GAAGCCAGTG ATCGAACAGC GGCTGGGCCA GCAGCTCGCG GCCACAGTGC GCGGTGGCGG GGTGTCTTGG GAGATTGGGC GCGCACGCGG GCCTGCTATC AGTTGA
|
Protein sequence | MSDRRDDDFR VRPSAPKNRG KGQGQSFVSK VLKQAGKASS GKSTVRLPAS ARGTGQRPGS RLGRGHTAAR FARAKLTPMS RRVTIKTLLV NHQRASPQSL AKHLRYIERD GVGRDGEPGQ AYGPQTDTAD LDAFKERCAD DRHHFRFILS PEDGAELEDL RTYTRHLMGR MEADLGTGLD WVAVNHWNTD NPHTHIVVRG RDDIGKDLII AGDYIADGFR HRAAELATEW LGPRTELEIQ QTLQREVEQE RWTSLDRTLK REAGDDGLLH VERLNEPHLQ RQRLLLIGRL QRLQRLGLAD ETQPGTWAVH ADAEKTLRAL GERGDIIRTM QRAMRGEPRE LAVFEPGDDG QTILGRVAAK GLADELRDRG YLVIDGVDGK AHYVALNAHD ELANYPTGAV VEVKGSADVR AADRNIAALA SDGLYRTDHH LAIEQGRAKA GRDPQEVVAA HVRRLEALRR AGIVERVAEG LWKVPDDLAE RGRQYDAQRL GGVAVELKSH LPIERQARVI GSTWLDQQLI GGGSGLGDLG FGGEAKQAMQ QRADFLTEQG LAERRGQRVI LARNLLGTLR NRELAQAAKD IAAESGLEHR PVADGQRVAG IYRRSIMLAS GRYAMLDDGM GFSLVPWKPV IEQRLGQQLA ATVRGGGVSW EIGRARGPAI S
|
| |