Gene Cagg_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3468 
SymbolglyA 
ID7269693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4225419 
End bp4226675 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content57% 
IMG OID643568276 
Productserine hydroxymethyltransferase 
Protein accessionYP_002464744 
Protein GI219850311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.624379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAAC ATCTGCGCGC AACTGACCCG ATCATTGCCG ATTTGATCGA GCGGGAAGCG 
CAACGCCAAC GGCAAGGACT TGAGCTGATC GCCAGCGAGA ACTATACGAG TCTCGCGGTG
ATGGAGGCCC AGGGGTCGGT ACTGACGAAT AAGTACGCCG AAGGGTTGCC TGGTCGGCGT
TACTACGGTG GCTGTGAGTT TGTCGATGCA ATTGAGCAAT TGGCCATCGA CCGCGCGTGC
CAGTTGTTCG GTACGTCGCA CGCCAATGTT CAGCCGCATA GCGGTGCGCA AGCCAATATT
GCGGTGTTTA CCGCTCTCTT GCAGCCCGGA GATACCATTC TTGGTATGCG GCTTGATCAC
GGTGGTCACC TGACCCACGG CAGCCCGGTG AACTTTTCGG GGAAGTGGTA TAACGTTCAT
TTCTACGGTG TTGATCCCCA AACGGGTCAG ATCGATTACG ACGATTTGGC GGCAAAAGCG
CGTGCTATCC GGCCTAAACT GATTACGTCA GGGGCGAGTG CTTATCCGCG CTTGATCGAT
TTTGCCCGTA TGCGCCAGAT CGCCGATGAG GTTGGTGCTT TGCTGATGGC CGACATTGCC
CATATTGCCG GGTTGGTCGC TACCGGCGAG CATCCATCGC CGGTCGGTCA TGCCCACATT
ATTACGACTA CCACGCACAA GACGCTGCGT GGGCCACGTG GTGGCTTGAT CCTGATGGGT
GAAGAATTTG CCAAACAGAT CAACTCGAGC GTCTTTCCAG GCACGCAGGG TGGCCCCCTG
ATGCACGTCA TTGCCGGTAA AGCAGTAGCG TTTGGCGAGG CCCTCCGCCC TGAGTTTAAG
CAGTACGCGG CTCAGATTCG GCGTAATGCC AAAGCGTTGG CCGAGGGGCT GCACGCTCAA
GGTTTAACGC TGGTGAGTGG TGGGACCGAT AATCACCTGA TGTTGGTTGA CCTGCGGAGT
ACCGGCCTCA CCGGTGCGCA GGCACAGCGT GCGCTTGATA AAGCTGCGAT CACCGTCAAT
AAGAATGCGA TCCCCGATGA CCCCCAGCCA CCGATGAAGA CGAGCGGGAT TCGGATTGGG
ACGCCGGCGG TCACAACACG AGGGATGCGT GAGCGTGAGA TGGCGCAAAT CGCAGCGTGG
ATTGGGGAAG TCCTGATGTA TCCCGATGAT GAGGTACGTT TGGCACGGAT CGCAGCCGAG
GTGGCCGAGA TGTGTCGCCA TTTTCCGGTA CCGGCAGATA TGGTGCAAGT ACGGTAA
 
Protein sequence
MLEHLRATDP IIADLIEREA QRQRQGLELI ASENYTSLAV MEAQGSVLTN KYAEGLPGRR 
YYGGCEFVDA IEQLAIDRAC QLFGTSHANV QPHSGAQANI AVFTALLQPG DTILGMRLDH
GGHLTHGSPV NFSGKWYNVH FYGVDPQTGQ IDYDDLAAKA RAIRPKLITS GASAYPRLID
FARMRQIADE VGALLMADIA HIAGLVATGE HPSPVGHAHI ITTTTHKTLR GPRGGLILMG
EEFAKQINSS VFPGTQGGPL MHVIAGKAVA FGEALRPEFK QYAAQIRRNA KALAEGLHAQ
GLTLVSGGTD NHLMLVDLRS TGLTGAQAQR ALDKAAITVN KNAIPDDPQP PMKTSGIRIG
TPAVTTRGMR EREMAQIAAW IGEVLMYPDD EVRLARIAAE VAEMCRHFPV PADMVQVR