Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2457 |
Symbol | |
ID | 7874141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2649881 |
End bp | 2650861 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643699380 |
Product | hypothetical protein |
Protein accession | YP_002889437 |
Protein GI | 237653123 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein |
TIGRFAM ID | [TIGR00374] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.379583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCG AAGCCCGGCG CACTCGACGC CCGCTGCCCG TCCTGCGTGC GCTTGCGAGC CTCGGGCTGC TCGCCGGCGT GCTGTGGTGG ATCGAACCGC GCGCCCTCAT CGCCGCCTTC GGCGCGCCTG CGCCCGCCTG GCTGGCGCTC GCGCTCGCGA TCGGCGTGCT GCAGACGGCC TTGTCGGCGT GGCGCTGGCA GCTCACCGCC GGCCGCCTCG GCGCCCCGCT CGGCTTCGCC CATGCGCTGC GCGAGTACTA CCTCGCGAGC TTTCTCAACC AGATCCTGCC GGGCGGGGTG ATGGGCGACG CCGCGCGCGC CTGGCGCCAC GCCAGTCTGC CGGGTACAGC CCGCGACGCA GCCTGGCAGG CAGTGGTGAT CGAGCGCGGC GCGGGCCAGC TCGCGCTGCT GCTGGTGGTG GTCGCGAGCG TGCTCGCCGC CCCCGCGCTG CAGGTTGTGC CGGGACGCCT GGGCGACGCC GTCGACCTCC GGGGACTGCC GTGGATGGGC GTGCTCGCCG TGCTCGCCCT CGTCGCGACA GCACTCGCAG GCAGCACCAG GACGGCCCTG CGCAGCCTGG CCAGCGCCAC CCGGCAGGCG CTGCTCGGGC GCGCGGTGCT GCTGCGCCAG CTGCTCGCCT CGCTGCTGAT CGTGGCCAGC TACGTCGCGG TGTGGCTGTG CTGCATGCGC ATGAGCGGGC TCGCCACGCC GCCAGCCCAG GCGGCGGCGC TGGTGCCCTG GGTGCTGCTG GCGATGGCGA TCCCGCTCTC GGTGGCGGGC TGGGGCATCC GTGAAGGCGC GGCCGCGCTG GTGTGGCAGG CGGCCGGGCT GGACGCCGCC GAAGGCGTGG CGGCCTCGGT GAGCTACGGC GTGGTCGTGC TGCTGTCGAC GCTGCCGGGC GCGCTGGCGC TGCGGCGCCA CGAGCCCGGC GCGCGCTCCA CCGCCGAGGC GACGCCTACT TGCCGGGCGG ACGGGCCTTG A
|
Protein sequence | MSGEARRTRR PLPVLRALAS LGLLAGVLWW IEPRALIAAF GAPAPAWLAL ALAIGVLQTA LSAWRWQLTA GRLGAPLGFA HALREYYLAS FLNQILPGGV MGDAARAWRH ASLPGTARDA AWQAVVIERG AGQLALLLVV VASVLAAPAL QVVPGRLGDA VDLRGLPWMG VLAVLALVAT ALAGSTRTAL RSLASATRQA LLGRAVLLRQ LLASLLIVAS YVAVWLCCMR MSGLATPPAQ AAALVPWVLL AMAIPLSVAG WGIREGAAAL VWQAAGLDAA EGVAASVSYG VVVLLSTLPG ALALRRHEPG ARSTAEATPT CRADGP
|
| |