Gene Tmz1t_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1517 
Symbol 
ID7083599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1695128 
End bp1696276 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID643698534 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002355171 
Protein GI217969937 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.702531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTGC GTAACACCCT GCTGGCCGCC CTCGGCCTCG CCTTCGCCGC CACTGCCGCC 
CATGCCGAGA TCAAGGTCGG TGTCGTGCTC TCGGCCACCG GCCCCGCCGC TTCGCTGGGT
ATCCCGGAAA AGAACACGAT CGCGCTGCTG CCGGCCACCA TCGGCGGCGA AAAGGTGAGC
TACATCGTGC TCGACGACGC CTCCGACACC ACCACGGCGG TCAAGAACGC CCGCAAGCTG
ACCGTCGAGG ACGGCGTCGA TGTGATCATC GGCTCCACCA CCAGCCCCGC TTCGCTGGCC
ATGGTCGACG TCGCCGCCGA GACGAAGACG CCGATGATCT CGATGGCCGC GTCGGCGCGC
ATCGTCGCGC CGATGGACGA CAAGAAGCGC TGGGTCTTCA AGACCCCCCA GAACGACCAG
CAGATGGCGT CGGCGATCGT CGAGCACATG GTCGCCAACA AGGTGAAGAA GGTGTCCTTC
ATCGGCTTCG CCAACGCCTA CGGCGAGGGC TGGTACGAGC AGTTCAAGAA GCTGGCCGAA
GCCAAGGGCA TCGAGATCGC CGCCAGCGAG CGCTTCAACC CGGCCGACAC CTCGGTGACC
GGCCAGGCGC TCAAGCTGAT GTCGGTGAAG CCGGACGCGG TGTTCATCGC CGGCTCGGGC
ACGCCCTCGG CGCTGCCGCA GAAGACGCTG CGCGAGCGCG GCTACAAGGG GCCGATCTAC
CAGACCCACG GCGTGGCCAA CAACGACTTC CTGCGCATCT GCGGCAAGGA CTGCGAAGGC
ACGCTGCTGC CGGTCGGCCC GGTGCAGATG GCGCGCAGCC TGCCCGACAG CCACCCCGTC
AAGGCGAGCG CGCTGGCCTA CGTGGAGAAG TACGAGGCCG CCAACGGCGC GGGCTCGGTG
TCGAGCTTCG GGGCCTACGC GTGGGACGCC GGGGTGTTGC TGCAGGCGGC CGTCCCTGCC
GCGCTCAAGG CGGCCAAGCC GGGCTCGGCG GAGTTCCGCA CGGCGCTGCG CGATGCGCTC
GAGGGCGTGA AGGAAGTCGC CGGCGCCACC GGCATCTACA CGATGAGCCC CGACGATCAC
CTCGGCCTGG ACGACCGCTC GCGCGTGATG ATCGAGATCC GCAACGGCAC CTGGTCGCTG
CTGAAGTAA
 
Protein sequence
MKLRNTLLAA LGLAFAATAA HAEIKVGVVL SATGPAASLG IPEKNTIALL PATIGGEKVS 
YIVLDDASDT TTAVKNARKL TVEDGVDVII GSTTSPASLA MVDVAAETKT PMISMAASAR
IVAPMDDKKR WVFKTPQNDQ QMASAIVEHM VANKVKKVSF IGFANAYGEG WYEQFKKLAE
AKGIEIAASE RFNPADTSVT GQALKLMSVK PDAVFIAGSG TPSALPQKTL RERGYKGPIY
QTHGVANNDF LRICGKDCEG TLLPVGPVQM ARSLPDSHPV KASALAYVEK YEAANGAGSV
SSFGAYAWDA GVLLQAAVPA ALKAAKPGSA EFRTALRDAL EGVKEVAGAT GIYTMSPDDH
LGLDDRSRVM IEIRNGTWSL LK