Gene Clim_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1067 
Symbol 
ID6354717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1170346 
End bp1171425 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content57% 
IMG OID642668684 
ProductL-threonine-O-3-phosphate decarboxylase 
Protein accessionYP_001943115 
Protein GI189346586 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.748471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTC TTTATTATCA TGAACACGGC GGAGAAACTG AACGTCGTTT CGGTGCTAAA 
CCCGCCGGTT TGCTCGATTT CAGCGTCAAC ATCAGCCCGC TTTTTCCGCT TCAGGAGCCT
TTGGCGATCG ACAGTACCGA TCTGCAGACT TACCCTTCAA TAGACGGAAA GGGAGTGTGC
GGATTCTACG CCAGAAAGTT CGGGCTGGAT GCAGCGTCTG TGATCGCCCT TAACGGGGCT
GTAGAGGGAA TCTACCTCCT GCCGAGGGCG TTGGGCATAC GCCGGATGCT GCTGCTTGCG
CCATCGTTTT ACGAATACGA ACGGGCCGCC CGTATTGCCG GCGCCGAAAT CGGATTTGTC
GAACTTGTCG CCGGGGACGG GTTCGCTCTC CCTGCAATCG GCGAACTGGC GGCCAGACTG
CAGCACTACG ATGCGTTTTT TGTCGCGAAT CCCAACAACC CGACCGGTAC TCTGTTTCCT
CCCGAAGTGA CCATGGCGCT TGCAAGCCGG TTTCCCGACA AGTGGTTTTT CGTTGACGAA
GCCTTTATAC AGTTTCAGCC GGATTTTCCG GAAGTGTCGC TGATGCGCCG TATTCCGGCT
TTCCGCAATA TCGTTGTCGT GCATTCGCTG ACGAAATTCT ATGCGCTTCC GGGACTGCGA
CTCGGTGCGC TCATAGCTCA TCCGGATACG ACCAGAAGAC TCTACGATTT CAAGGAGCCC
TGGACGGTCA ATGCCGTTGC CGAAAGGGTT GCGGGCGAAC TGGCCGGGTG CTTCGCTTAT
GAAGCGGCTC TCCGTTCGAT GATCGATTGC GAAAGAGGAC GGCTCGCTGA GGCTCTGACG
GAAATCGAAG GGGTGCGCAT TGCCGGGGGA GCGGCGAACT TTTTTCTCGC CCAATGGCGC
CGTTCGAGTT CGCTGGATGA ATTGATTGCA CATTTTCTGT CGCAGGGCAT AAAGGTGCGG
GACTGCAGGA ATTTCAGGGG TCTCGAGGCC GACTATTTCC GTTTTGCCGT CCGCACGCCG
CAGGAGAACG ACCGTTTTCT CGAAGCGCTT CGTGCCGTTC CGGCGCTGCA ATGGGCGTGA
 
Protein sequence
MNSLYYHEHG GETERRFGAK PAGLLDFSVN ISPLFPLQEP LAIDSTDLQT YPSIDGKGVC 
GFYARKFGLD AASVIALNGA VEGIYLLPRA LGIRRMLLLA PSFYEYERAA RIAGAEIGFV
ELVAGDGFAL PAIGELAARL QHYDAFFVAN PNNPTGTLFP PEVTMALASR FPDKWFFVDE
AFIQFQPDFP EVSLMRRIPA FRNIVVVHSL TKFYALPGLR LGALIAHPDT TRRLYDFKEP
WTVNAVAERV AGELAGCFAY EAALRSMIDC ERGRLAEALT EIEGVRIAGG AANFFLAQWR
RSSSLDELIA HFLSQGIKVR DCRNFRGLEA DYFRFAVRTP QENDRFLEAL RAVPALQWA