Gene Cagg_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3603 
Symbol 
ID7269747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4379112 
End bp4380239 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID643568411 
Productpyruvate carboxyltransferase 
Protein accessionYP_002464877 
Protein GI219850444 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02146] homocitrate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.109467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGC CTGAGCGCCT GTTTTTTGTC GATACCACCC TGCGCGAAGG CGAACAGTTC 
GCCAGCGCCC GCTTTACGTC CACCCAACGG CTTGCTATCG CAGAAATGCT CGACGCATTC
GGCGTTGAGT ATATCGAACT CACCTCTCCG GCAGCTTCGC CGCAAAGTGC GCGTGATCTC
GCCACCATTG CCCGTCGCGG TCTCCGCGCC CGTATCCTCA CCCATATCCG CTGTCACATG
GCCGATGCAC GCCTCGCCGT CGAACACGGT GCGCAAGGTG CGAATCTGCT CTTCGCTACG
TCCGAACCCC TACGCACGGT GAGCCACGGA CGCAGCCTCG ATGAGATTTT GGCTGAAGCG
CAACAGGTGA TCACTTACCT GCGCGACCAC GACGTCGAGG TGCGCTTTTC GTGTGAAGAT
AGTTTCCGCA CCGACCTTGC CGACTTGATC CGCATTTACC GCGCGGTCGA GACGATGGGC
GTCCAACGGA TCGGTCTTGC CGATACCGTT GGCATCGCTA CGCCGCGTCA AGTCTATGAA
GTGGTTAGCG CTGTGCGTGC TGAAGTCACA TGCGACATCG AATTTCACGG CCACAACGAT
AGTGGCTGCG CAGTCGCCAA TACCTTCTGC GCTTACGAAG CCGGTGCGAC CCACCTCGAT
GTGACGGTAC TTGGGATCGG TGAACGCAAC GGTATTGCCA GTCTAAGCGG GATGATTGCA
CGGATTGCGA GCGTCGATCC GGATCGTGTT CGGCGGTATC GTCTCGATCT GTTGCCTAAG
ATCGACGAGA CGGTAGCAAC CATGCTCGGC ATCGAAATCC CATTCAACCA GTGCATTACC
AGTCCGACCG CTTTTCACCA CAAGGCCGGG ATGCACACGA AAGCCGTGCT GGCCGATCCA
CGCAGCTACG AAGTGCTCGA TCCGAACCTG TTCGGTCGCC AGCGCACCAT TGCGATTGCC
CACCGGTTGG TGGGGTGGCA CGCCGTCGCC GAACGCGCCC GCGAACTGGG TATCACCCTC
AGCGAAGCGC AAGCCCGCGC CGCCGCCGCC CGCATTAAAG CTCTCGGCGA CGAACACGAC
CTTGATGGCG CAATGATCGA TGAGATTCTT TATAGCTACG CCGAATAA
 
Protein sequence
MSLPERLFFV DTTLREGEQF ASARFTSTQR LAIAEMLDAF GVEYIELTSP AASPQSARDL 
ATIARRGLRA RILTHIRCHM ADARLAVEHG AQGANLLFAT SEPLRTVSHG RSLDEILAEA
QQVITYLRDH DVEVRFSCED SFRTDLADLI RIYRAVETMG VQRIGLADTV GIATPRQVYE
VVSAVRAEVT CDIEFHGHND SGCAVANTFC AYEAGATHLD VTVLGIGERN GIASLSGMIA
RIASVDPDRV RRYRLDLLPK IDETVATMLG IEIPFNQCIT SPTAFHHKAG MHTKAVLADP
RSYEVLDPNL FGRQRTIAIA HRLVGWHAVA ERARELGITL SEAQARAAAA RIKALGDEHD
LDGAMIDEIL YSYAE