Gene Tmz1t_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3194 
Symbol 
ID7874334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3476982 
End bp3478229 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content71% 
IMG OID643700123 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002890166 
Protein GI237653852 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.100454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCT TCCTCGCGCT GCTCGGGCGC GCCTTCGTGA CGCTTCTGCT CCTGGCGCCG 
CAGTTGTCGG CACAGGCCGT CCCGGCAGAG GCCGTGGGCG CGGGCGCACG CCCGAGCGGC
GGCGAGCGGG CACCGATCCG GGTGGGCGTG TCGGGGCCCT TCTCCGGTCC GTCGGCGCCG
ATGGGGCTGT CGATGCGCGA GGGCATCCGC ATCGCCGCCG AGGAGCTCAA CGCCGGCGGC
GGCCTGCTCG GTCGGCGCAT CGAGCTCGTC GAGCGCGACG ACGAGGCGAG CAACGAGCTT
GGTGCGCAGA TCGTGCGCGA CTTCATCCAT CGCGAACGTG TCACCGCCGG ACTGGGCATC
GTCAATACCG GTGTCGCGCT CGCCAGCCAG CGCCATTACC AGATGGCGCG CATCCCGGTG
ATCACCTCGG TCGCCACCGG CTCGCTGATC ACCAAGCAGT TCCAGCCCCC CGATTTCACC
GAGAACTACG TGTTCCGCGT CTCCGCCAGC GACACCTTGC AGGCCGCGGT GATCGTCGAG
GAGGCTGTCG GCCGGCGCGG CCTGACCCGG CTCGCGATCC TGCACGACGC CACCAACTAC
GGGGTGCTGG GAAGCCAGGA TCTCATCGCA GCCCTGGGCA CGCGCGGGCG GACCGCGGTG
GTGGTCGAGC GCTTCCAATT GCGCGAGACC GACATGCGAC CGCAGCTCGA ACGCGCACGC
GCCGCCGGTG CGCAAGCCGT GCTCACCTAT GGCATCGGCC CCGAGCTCGC CCACATCGCC
AACTCGATGG CCCGCCTCGG ATGGCAGGTG CCGATCATCG GCAGCTGGAC GCTGGCCATG
TCGAGCTTCA TCGAACTCGC CGGCCGCAAC GCCGAGGGCG CACGCATGCC ACAGACCTTC
ATCGCCGAGG CACGCAGCCC GGCGCAGGCG GCCTTCCTCG CGGCGTGGGA GCGCGCCACC
GGCAGCGTGC GTATCCCGGT GCCACCGGCG GCCGCACAGG GCTACGACTC GATGCAGCTG
CTCGCCGCGG CGATCCGCCA GGCCGGCAGC CTCGACGGTC CGCGCATCCG CGAGGCGCTC
GAAAACCTGG ATGCGGAGGT CGATGGCGTG ATCATGCGCT ATCGCCGCCC CTTCTCCCGC
GACAACCACG AAACCCTGCG CAGCGCGCGC CAGATCCACC TCGGCGAGAT CCGCGACGGC
GCCGTGGTGT TCGCCCACGA GGCGGCGCCG GAGAGGCCGG GACCATGA
 
Protein sequence
MKPFLALLGR AFVTLLLLAP QLSAQAVPAE AVGAGARPSG GERAPIRVGV SGPFSGPSAP 
MGLSMREGIR IAAEELNAGG GLLGRRIELV ERDDEASNEL GAQIVRDFIH RERVTAGLGI
VNTGVALASQ RHYQMARIPV ITSVATGSLI TKQFQPPDFT ENYVFRVSAS DTLQAAVIVE
EAVGRRGLTR LAILHDATNY GVLGSQDLIA ALGTRGRTAV VVERFQLRET DMRPQLERAR
AAGAQAVLTY GIGPELAHIA NSMARLGWQV PIIGSWTLAM SSFIELAGRN AEGARMPQTF
IAEARSPAQA AFLAAWERAT GSVRIPVPPA AAQGYDSMQL LAAAIRQAGS LDGPRIREAL
ENLDAEVDGV IMRYRRPFSR DNHETLRSAR QIHLGEIRDG AVVFAHEAAP ERPGP