Gene Caul_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3047 
SymbolglyA 
ID5900502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3310270 
End bp3311574 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content68% 
IMG OID641563549 
Productserine hydroxymethyltransferase 
Protein accessionYP_001684672 
Protein GI167647009 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.302237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.92083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC CGGCCAGCAA CATCACCGCC GACAAGAACG CCTTCTTCGG GGCCGACCTG 
GCCGCCGCCG ATCGCGACAT CTTCGATCGC ATCGGCCTGG AGCTGAACCG CCAGCAAAAC
CAGATCGAGC TGATCGCCTC GGAGAACATC GTCTCGCGCG CGGTGCTCGA AGCCCAGGGC
TCGATCCTGA CCAACAAGTA CGCCGAGGGC TATCCGGGCA AGCGCTACTA TGGCGGCTGC
GAATATGTCG ACGAGATCGA GACCATCGCC ATCGAGCGCG CCAAGGCGCT GTTCGGGGCC
GGCTTCGCCA ACGTCCAGCC GCACTCGGGC TCGCAAGCCA ACCAGTCGGT GTTCATGGCC
CTGCTGCAGC CGGGCGACAC CTTCCTGGGC ATGGACCTGG CCGCCGGCGG CCACCTGACC
CACGGCAGCC CCGCCAACCA GTCGGGCAAG TGGTTCAAGC CGGTGTCGTA TACCGTGCGC
CAGCAGGACC AGCTGATCGA CTACGACGCC GTCGAAGAGG TCGCCCAGGC CAGCAAGCCC
AAGCTGATCA TCGCCGGCGG CAGCGCCTAT AGCCGCCAGA TCGATTTCGC CCGCTTCCGC
CAGATCGCCG ACAGCGTCGG CGCCTATCTG ATGGTCGACA TGGCTCACTT CGCGGGCCTG
GTGGCCGGCG GTGTGTTCCC CAGCCCCATC CCGCATGCCC ACGTGGTCAC CACCACCACC
CACAAGACCC TGCGCGGCCC GCGTGGCGGC ATGGTGCTGA CCAATGACGA GGCGATCATC
AAGAAGGTCA ATTCGGCCGT GTTCCCGGGC CTGCAGGGCG GTCCGCTGGA GCATGTGATC
GCCGCCAAGG CCGTGGCCTT CGGCGAGGCG CTGCAGCCGG CCTTCAAGGC CTATGCGCAA
GCGGTGATCG ACAACGCCCG CGCGCTGGCC GAGGCCCTGC AGACCCAGGG CGTCAACATC
GTCTCGGGCG GCACCGACAG CCACCTGATG CTGGTCGACC TGCGGCCCAA GGGCGTGACC
GGCCGCGACG CCGAGCACAG CCTCGAGCGC GCCCACATGA CATGCAACAA GAACGGCGTG
CCGTTCGACA CCGCGTCGTT CGCCGTCACC TCCGGCATCC GCCTGGGCAC GCCGGCCGGC
ACCACCCGCG GGTTCGGCGC GGCCGAGTTC ACCCGGGTCG GCCAGCTGAT CGGCGAGGTC
GTCAACGGCC TGGCCGCCAA CGGCGTCGAC GGCAACGGCG CGGTCGAGGC CAAGGTCCGC
GAGGAAGTCC TGGCCTTGAC GGCGCGGTTC CCGATCTACA ACTAA
 
Protein sequence
MTAPASNITA DKNAFFGADL AAADRDIFDR IGLELNRQQN QIELIASENI VSRAVLEAQG 
SILTNKYAEG YPGKRYYGGC EYVDEIETIA IERAKALFGA GFANVQPHSG SQANQSVFMA
LLQPGDTFLG MDLAAGGHLT HGSPANQSGK WFKPVSYTVR QQDQLIDYDA VEEVAQASKP
KLIIAGGSAY SRQIDFARFR QIADSVGAYL MVDMAHFAGL VAGGVFPSPI PHAHVVTTTT
HKTLRGPRGG MVLTNDEAII KKVNSAVFPG LQGGPLEHVI AAKAVAFGEA LQPAFKAYAQ
AVIDNARALA EALQTQGVNI VSGGTDSHLM LVDLRPKGVT GRDAEHSLER AHMTCNKNGV
PFDTASFAVT SGIRLGTPAG TTRGFGAAEF TRVGQLIGEV VNGLAANGVD GNGAVEAKVR
EEVLALTARF PIYN