Gene Tmz1t_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3683 
Symbol 
ID7873188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4045355 
End bp4046638 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID643700629 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_002890653 
Protein GI237654339 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCC GTAACGAAGC CCTCTTCCAG CGCGCCCAGC GCAGCATCCC CGGTGGCGTA 
AATTCGCCCG TCCGCGCCTT CCGCTCGGTG GGCGGCACGC CGCGCTTCCT TGCGCGCGCC
GAAGGTGCCC GGGTGTGGGA TGCGGATGGC AAGGAGTACA TCGACTACGT CGGCTCCTGG
GGCCCGGCGA TCGCCGGCCA CGCCCACCCG GCGATCATCG AGGCGGTGCG CGAGGCGGCG
CTGAAGGGCC TGTCCTTCGG TGCGCCCACC GAGAGCGAGG TCGACATGGC CGAGCTGATC
TGCGCCATGC TGCCCTCGGT GGAGATGGTG CGCCTGGTCA GTTCGGGCAC CGAGGCGACC
ATGAGCGCGA TCCGGCTGGC GCGCGGCTTC ACCGGCCGCG ATGCGATCGT GAAGTTCGAG
GGCTGCTACC ACGGCCACGC CGACAGCCTG CTGGTGAAGG CCGGCTCCGG CCTGCTGACC
TTCGGCAACC CGTCCTCGGG CGGCGTGCCG GCGGACTTCG CCAAGCACAC CATCGTGCTC
GACTACAACG ACCTGCAGCA GGTCGAGGAC GTGTTCAAGG CGCGCGGCGA CGAGATCGCC
GCGATCATCG TCGAGCCGGT GGCCGGCAAC ATGAACCTGA TCAAGCCGCA GCCCGGCTTC
CTCGAAGGCC TGCGCCGCAT CTGCACCGAG TACGGCGCGG TGCTGATCTT CGACGAGGTG
ATGACGGGGT TCCGCGTCGG CCCGCAGGGC GTGCAGGGCC TCTACGGCAT CACCCCCGAC
CTGACCACGC TCGGCAAGGT GATCGGCGGC GGCATGCCGG TGGGCGCCTT CGGCGGTCGT
CGCGACATCA TGGAAAAGAT CGCCCCGCTG GGCAGCGTTT ACCAGGCCGG CACCCTGTCG
GGCAGCCCGG TGGCAGTGGC GGCGGGCATG GTGTCGCTGC AGCTGACGCG TGAGGCCGGC
TTCTACGAGC GGCTGACGGC GAGCACGCAG CGACTGGTTG CGGGTCTCGC CGCGGCGGCC
AAGGATGCCG GCGTCACGTT CAGCGCGGAC TCGGTCGGCG GCATGTTCGG CGTCTATTTC
GCCGACAAGG TGCCGGCCTC CTTCGCCCAG GTGATGGCTA CGGACAAGGA GCGCTTCAAC
CGCTTCTTCC ACGCCATGCT GGAAGCCGGC CACTACTTCG CGCCCTCGGC CTTCGAGGCC
GGCTTCGTGT CGGTGGCGCA TGGCGAGGCC GAGATCGACG CCACGGTGGC CGTGGCGCGC
GCGGTCTTCG CCCAACTGGG CTGA
 
Protein sequence
MTSRNEALFQ RAQRSIPGGV NSPVRAFRSV GGTPRFLARA EGARVWDADG KEYIDYVGSW 
GPAIAGHAHP AIIEAVREAA LKGLSFGAPT ESEVDMAELI CAMLPSVEMV RLVSSGTEAT
MSAIRLARGF TGRDAIVKFE GCYHGHADSL LVKAGSGLLT FGNPSSGGVP ADFAKHTIVL
DYNDLQQVED VFKARGDEIA AIIVEPVAGN MNLIKPQPGF LEGLRRICTE YGAVLIFDEV
MTGFRVGPQG VQGLYGITPD LTTLGKVIGG GMPVGAFGGR RDIMEKIAPL GSVYQAGTLS
GSPVAVAAGM VSLQLTREAG FYERLTASTQ RLVAGLAAAA KDAGVTFSAD SVGGMFGVYF
ADKVPASFAQ VMATDKERFN RFFHAMLEAG HYFAPSAFEA GFVSVAHGEA EIDATVAVAR
AVFAQLG