Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1882 |
Symbol | |
ID | 7084305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2123683 |
End bp | 2125857 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698905 |
Product | tail sheath protein |
Protein accession | YP_002355530 |
Protein GI | 217970296 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.138498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAGT ACCTCGCTCC AGGCGTCTTT GTCGAAGAGA CGAGCTTCCG CAACAAATCC ATCGAGGGGG TCGGCACCAG CGTCGCCGCG CTGGTGGGCC CCACCCGCAG CGGCCCGCTC CGCGGTCTCC CCGAGGTGCT GACGAGCTAC GCGGACTTCG AACGCATCTA CGGCGACGCG CACGACCTGG CCTTCGGCGC GGACGGCGAG GCCGTGCTCA ACCACACCGC GCACGCGGCG CGCGCCTTCT TCGACAACGG CGGCAAGCAG CTCTTCGTCG CCCGCGTCAT GCAGGGCGTC AACGGCCTCG CGGCGGACGC GGCCGGCGTG TGGCCCGGTG CCGCCATGAA GGCCGACGCC GGAAACACGC TGAGGTTCTT CAGCCGCTTT CCGGGCCGGA TGGGACGGTA CACCCTCGAG CTGCGCTGGC GCGACAGCGA GAACCTGCTG CGCAGCGAGA CCGCGGCAAG TGCCCGCGCA GGCGCGCTCT ACTTCCTCGA GGCGCGGGGC GTGTCGCTCG CCGCGAAGGC AGCCGGTCCG ATCGACGACG CGGCCTTCCC GGTCGACCTG CGGGCCATCG TCCGGCTCGA CGGCGCCAAC CTGGCGATCC AGGACGGGCG CGCCACGATC GTGTCGAGGG CCGACGAGGA TGCCCCCGCG GCGCTCGCGG CAAACCAGAT CACCAGCCTG ATCCCGGCCC AGCTGCCCGC CGGCGCGAGG CTCACCCGCG TGTTCGCGCG TCCGCCCGCA TCCGGCGCAC TGGCGGCGGG CACGCCGGCC GAATTGCGCC TGGCCGAGGT CGTCGATCTC GCCGCCTTCA CCGGCGGTGC GGGCTGGGGC AGCCGCAAGG CAGTGCGCGG CACACTGAAT GCCGCGGGCG AGGTCTTCAC GGCCACACCC GCGCTCAATC CGGGCCTCGC CGAAGCCGTC ACGCTGCCCC TCGCCGCACT CGCCGCGGCG CCCGGTGCCG CGCGCGCGCT GTTCGTGCTG CGCAGCTTCG ACGTCGATGT GCGCAACGGC GGCAAGGACG GCGAAGTGAT CCGCACCTAC CCCGGCCTGA CGACCGCCGC CACGGGGCCG ACCAGCCTCG CCGCCGGGCT TGCGGTCACA CCCGACCGTC GCGCCGAGGC GCTGAGCTCG CCGGTCGCCT GCCGCCTCGC CGCGAACGCG ACCGATGACG CGATCCTCGC CGCGCTGCTG TCCATCTGTG CCCCCGCCGC GCGCGACCCG GGCGAGGCCT CGCTCGACGA ACCGCGCTAC CTGATCGAGC TCGAGGGCGG CAGCGACGGC GACGAGCCGG GCGCGATCGA CTACGCCGGC GAAGCCGACG AGACCAAGGG CAGCACCGGA CTGAAGGCGC TCGAGGACGT CGAGGACGTC GCCATCGTGA TGACGCCCGC AGCCGCGGGC AGCGCCGAGG ACGTGCACAA GGCGGTCGTC ATGGAGGTGC AGAAGCACTG CCGGCGCATG CGCTACCGCA TCGGCATCGT GGACGCGCGC GCCGGGCAGA GCCTCGGCGA GCTGCGCGCC TTCGCCGGCA ACTTCGACGA CTCCCGCCTC GCGCTCTACC ACCCCTGGGT GGTGATCCCC GACCCGACCC GCACCCGGCG CGACATCACG GTGCCGCCCG CCGGCTTCAT CGCCGGCGTG TATGCGCGCA CCGATGTCGA CCGCGGCGTG CACAAGGCGC CCGCGAACGA GATCGTCATG GGTGCGCTGC GCTTCGACCA GCAGATCAAC GCCTTCCAGC AGGAGCTCCT CAATCCGAAC GGCATCAACT GCCTGCGCTC CTTCGCCGGA CGCGGCCACC GCGTGTGGGG CGGACGCACG CTCTCCAGCG ATCCCGAATG GAAGTACGTC AACGTGCGGC GCTACTTCCT CTATCTCGAG CGCTCGATCG AGAAATCCAC CCAGTGGGCG GTGTTCGAGC CGAACGGCGA GGCACTGTGG GCCAACATCC GTTCCAGCGT GGAGGACTTC CTCTTCGCCG AATGGCGCAA CGGTCGCCTG CTGGGCGCCA CGCCGAAGGA GGCCTTCTTC GTGCGCTGTG ACCGCTCGAC GATGACCCAG AACGACATCG ACAACGGCCG CATGGTGTGC CTCGTCGGCG TCGCCGCGCT CAAGCCCGCC GAATTCGTCA TCTTCCGCAT CGGTCAGAAG ACCGCCGACG CCTGA
|
Protein sequence | MPEYLAPGVF VEETSFRNKS IEGVGTSVAA LVGPTRSGPL RGLPEVLTSY ADFERIYGDA HDLAFGADGE AVLNHTAHAA RAFFDNGGKQ LFVARVMQGV NGLAADAAGV WPGAAMKADA GNTLRFFSRF PGRMGRYTLE LRWRDSENLL RSETAASARA GALYFLEARG VSLAAKAAGP IDDAAFPVDL RAIVRLDGAN LAIQDGRATI VSRADEDAPA ALAANQITSL IPAQLPAGAR LTRVFARPPA SGALAAGTPA ELRLAEVVDL AAFTGGAGWG SRKAVRGTLN AAGEVFTATP ALNPGLAEAV TLPLAALAAA PGAARALFVL RSFDVDVRNG GKDGEVIRTY PGLTTAATGP TSLAAGLAVT PDRRAEALSS PVACRLAANA TDDAILAALL SICAPAARDP GEASLDEPRY LIELEGGSDG DEPGAIDYAG EADETKGSTG LKALEDVEDV AIVMTPAAAG SAEDVHKAVV MEVQKHCRRM RYRIGIVDAR AGQSLGELRA FAGNFDDSRL ALYHPWVVIP DPTRTRRDIT VPPAGFIAGV YARTDVDRGV HKAPANEIVM GALRFDQQIN AFQQELLNPN GINCLRSFAG RGHRVWGGRT LSSDPEWKYV NVRRYFLYLE RSIEKSTQWA VFEPNGEALW ANIRSSVEDF LFAEWRNGRL LGATPKEAFF VRCDRSTMTQ NDIDNGRMVC LVGVAALKPA EFVIFRIGQK TADA
|
| |