Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3270 |
Symbol | |
ID | 7874491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3581110 |
End bp | 3582720 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700204 |
Product | exosortase 1 |
Protein accession | YP_002890242 |
Protein GI | 237653928 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein [TIGR03109] exosortase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.648481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGA AAGGAACCGC GATGCTTCAA TCCGTGCGGC CGGGCGCGAT GCCGACCCAC CTGCAGGCGA CCGACGCCCG CCCTGCGACG CTCGGTGATC CGCGCACGCG CAGGCTTCAC CTGGCGGTGC TGGCGCTGGG ACTGCTGTGG CTGCTGTTCT GGTATCGCGA CACCTTCCTG GCCATGGTGG GCATCTGGGA CCGCTCGGAC ACCTTCGCCC ACGGCTTCGT GATCGCGCCG ATCTCGGCCT GGCTGGTATG GCGGCGCCGG CACTTCATCG ACCATGTGCC GATCGCGCCC TCGCTGCTCG GCATCTGCGC CGGACTCGTC GCCGGCGCGG CCTGGTTGCT CGGCGAGCTC GCCAGCGTCG ATGCGGTGTC GCAGTTCGCC TTCGTGGGCA TGCTGGTGGG CTTCGTCTGG GCGGTGATGG GCAGCGCGGT GGTGCGCACC TACGCCTTCC CGATCGGCTT CCTGTTCTTC ATGGTGCCGT TCGGCGAGTT CCTCTTCCCC ACCATGATGC AGTGGACCGC CGACTTCATC ATCTGGGCGG TGCGCGCCTC CGGAGTGCCG GTGTACGCCG AAGGCTTCCT GCTGGTGATC CCCTCCGGGC GCTGGCAGGT GGTGGAGGGG TGCAGCGGCG TGCGCTACCT GATGGCCTCG ATCGTCGTCG GCAGCCTGTA CGCTTACCTG AACTACCGCA GCACGCAGAA GCGCCTGCTT TTCGTCGCCG CCTCGATCGT GCTGCCGGTA TTCGCCAACT GGCTGCGCGC CTACGGCATC GTGATGCTCG GCCACCTCAC CGACAACCGC CTCGCTGCCG GGGCCGACCA CCTGATCTAC GGCTGGGTCT TCTTCGGCAT CATCATCCTC GCGCTGTTCT GGATCGGTTC GCGCTGGCAG GAGGACGAGG AAGAGGCCCC GGCCATGCCC GTCGGCTCCG CGCGATCGTC GGGCGTACGC ACCGGTGGCC TGGGCTGGCT GGCCGTGGCG GTGGTCGCGG TGGGGCTGTG GGTGCCGGTC CTCGGCCACC TCGAGCGCCA GGGCGAGCAG GGGCCGGTGC GCTTCGCCGC GCTGGAGGGG CAGGGCGCGT GGCAGCGGAC GGAAGCGGGC GAATTGCCGC CGTGGTCGCC GAGCTATTCC GGCATGCGCG ACACCTTCCG TTCGACCTGG CGCGCGGGGA GCGTTCCGGT GGGGGTGTAC ATCGGCTATT ACCGCGACCA GGGACCGGGC AGGGAGTTGA TCAATTCCGA GAACCGCGTG CTGATCAGCA AGGACCCGGT GTGGCGCATG ACCGCCGCGG GTCCGGTCCG GGTGCAACTC GGCGACCAGG CGCAGGCGTG GCGCACGCTG GAGATGGCGA GCGAGCATGC ACGCATGGTC GTCTGGTACG CGTACTGGAT CGGCGGCCGG TGGACCACCA GCGATCATCT TGCCAAGGTC TATCTGGCGC TGAGCCGCCT GCGCGGCGAA GGGGACGATT CGGCGGTGGT CATGCTGCAT GCGCCGCAGG GTCAGGGCGG GCACGCCGGC ACGATCGCCG CGCTTGAAGA CTTCGCACGC GAGATGGGCG GGCCCCTGCA GGCCATGCTG GATCGCACCG CGAACCCTTG A
|
Protein sequence | MNMKGTAMLQ SVRPGAMPTH LQATDARPAT LGDPRTRRLH LAVLALGLLW LLFWYRDTFL AMVGIWDRSD TFAHGFVIAP ISAWLVWRRR HFIDHVPIAP SLLGICAGLV AGAAWLLGEL ASVDAVSQFA FVGMLVGFVW AVMGSAVVRT YAFPIGFLFF MVPFGEFLFP TMMQWTADFI IWAVRASGVP VYAEGFLLVI PSGRWQVVEG CSGVRYLMAS IVVGSLYAYL NYRSTQKRLL FVAASIVLPV FANWLRAYGI VMLGHLTDNR LAAGADHLIY GWVFFGIIIL ALFWIGSRWQ EDEEEAPAMP VGSARSSGVR TGGLGWLAVA VVAVGLWVPV LGHLERQGEQ GPVRFAALEG QGAWQRTEAG ELPPWSPSYS GMRDTFRSTW RAGSVPVGVY IGYYRDQGPG RELINSENRV LISKDPVWRM TAAGPVRVQL GDQAQAWRTL EMASEHARMV VWYAYWIGGR WTTSDHLAKV YLALSRLRGE GDDSAVVMLH APQGQGGHAG TIAALEDFAR EMGGPLQAML DRTANP
|
| |