Gene Clim_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1965 
SymbolglyA 
ID6355020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2180109 
End bp2181434 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID642669563 
Productserine hydroxymethyltransferase 
Protein accessionYP_001943976 
Protein GI189347447 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.128494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATA CCGACATCCT GAGGATGCAG GATAAAGAGG TTTTCGAGGC GATAGCCGGC 
GAAACCCTGA GGCAGACAGA AACCCTCGAA CTCATCGCAT CCGAGAACTT CACCAGCAGG
GCCGTCATGC AGGCCTGCGG CTCGCTCATG ACCAACAAAT ATGCCGAAGG TTATCCCGGA
AAGCGCTATT ACGGAGGGTG CGAGTTTGTC GATATTGCTG AAAATCTTGC CCGCGATCGT
GCAAAAAAAC TTTTCGGCTG CCAGTATGTC AACGTTCAGC CGCATTCCGG TTCGAGCGCC
AACATGGCGG TGCTTTTTTC GGTGCTCAAG CCGGGCGACC GCATTATGGG CCTCGATCTC
TCGCATGGAG GCCATCTTAC GCACGGCAGC CCGGTGAACT TTTCAGGGCA GCTTTTTGAT
GCACACTCCT ACGGCGTCGA CCGTGAGACC GGCTGCATCG ACATGAACCG GGTCGAAGAA
CTGGCGCTTG AGGTCCGTCC TAAACTCATC ATCTGCGGTG CGAGCGCCTA CTCTCAGGGG
TTTGATTTCA AGGCATTCAG GGAGATCGCC GACAAGGTCG GTGCCCTTCT GATGGCCGAT
ATCGCCCACC CTGCAGGTCT GATTGCCGCC GGGCTGCTCA GCGACCCCAT GCCGCACTGT
CATTTCGTTA CCACGACTAC CCACAAGACG CTCCGCGGCC CCAGAGGGGG TATGATCATG
ATGGGCAGCG ACTTTGAAAA TCCTCTCGGC ATTACCATCA AAACGAAAAC CGGATCGAGG
GTGAAAATGA TGTCGGAGGT CATGGATGCC GAAGTGATGC CCGGTATTCA GGGTGGTCCG
CTCATGCACA TCATAGCGGG AAAGGCCGTT GCCTTCGGCG AGGCGCTGCA GCCGGCATTC
AGGGAGTATG CCGTGCAGGT CAGGAAAAAT GCAGCTGCAA TGGCCGAAAG TTTTGCCGGT
CTCGGTTATA ATATTGTCAG CGGCGGCACC AAAAACCATC TCATGCTGCT CGATCTGCGC
AACAAGGAGG TTAACGGCAA GGTGGCGGAA AATCTGCTGC ATGAGGCAGG CATCACGGTC
AACAAGAATA TGGTGCCGTT TGACGATAAA TCGCCTTTCG TTACCAGCGG CATCAGGATC
GGTACTGCGG CCATGACCAC TCGCGGGATG ACCGAAAACG ACAGCCGGAC GGTTGCCGGG
CTGATCGACC AGGTTATTTC ATCGGCGAAT TCCGCCGGAG TAGAAGAGAT ATGCCGTACA
GTACGGCATG ATATCAGGGA ACTCTGTTTG GCTTATCCGC TTGAAGGATA CGGCGTAAAC
CCCTGA
 
Protein sequence
MMDTDILRMQ DKEVFEAIAG ETLRQTETLE LIASENFTSR AVMQACGSLM TNKYAEGYPG 
KRYYGGCEFV DIAENLARDR AKKLFGCQYV NVQPHSGSSA NMAVLFSVLK PGDRIMGLDL
SHGGHLTHGS PVNFSGQLFD AHSYGVDRET GCIDMNRVEE LALEVRPKLI ICGASAYSQG
FDFKAFREIA DKVGALLMAD IAHPAGLIAA GLLSDPMPHC HFVTTTTHKT LRGPRGGMIM
MGSDFENPLG ITIKTKTGSR VKMMSEVMDA EVMPGIQGGP LMHIIAGKAV AFGEALQPAF
REYAVQVRKN AAAMAESFAG LGYNIVSGGT KNHLMLLDLR NKEVNGKVAE NLLHEAGITV
NKNMVPFDDK SPFVTSGIRI GTAAMTTRGM TENDSRTVAG LIDQVISSAN SAGVEEICRT
VRHDIRELCL AYPLEGYGVN P