Gene Cagg_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3738 
Symbol 
ID7267811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4553631 
End bp4554917 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID643568545 
Productcitrate synthase I 
Protein accessionYP_002465010 
Protein GI219850577 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.680995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.033422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA ACTCCCTCAC TGTTATCGAC AACCGTACCG GTAAGACCTA CGAAATTCCG 
ATTGAGCATG GCGCCATCCG GGCAACCGAT TTACGCCAGA TCAAAGTCTC TGACGACGAT
TTTGGCTTGA TGTCGTATGA TCCGGCTTAT CTCAATACTG CTTCGTGCAA GAGTAGCATC
ACCTTCATTG ACGGTGACAA AGGCATTCTC GAGTATCGCG GTTATCCGAT CGAACAGCTT
GCGGAGCAAA GCTCGTATCT CGAAGTAGCC TATCTCTTGC TCTATGGTGA GCTACCATCA
AAAGAGCGAT TGGAGTGGTG GGAATATCGC ATTAGTCGCC ATCTGTTCTT ACACAATAGC
CTCGTCGAGT TGATTCAGGC CTTCCGCTAC GATGCGCATC CGATGGGTAT CTTGATCAGC
TCGGTAGCGG CGATGTCGAC GTTGTACCCC GAAGCGAAAA ACATTCACGA TCCTGCCGTG
CGCGAGAAGC AGATTTGGCG TATTATCGGT CAGATTCCAA CCATCGCTGC GTTTGCCTAT
CGACACCGCA TCGGACGACC GTTTAACTTG CCCGATAGTT CGCTGAGCTA CACGGCCAAT
TTGCTCTACA TGATGGACTA CATGAACCAA CGCGAATATG AAGTTAATCC GGTGTTGGCC
AAGGCGTTAG ATGTGCTCTT CATCTTGCAT GCCGATCACG AGCAGAACTG CTCAACATCG
GTGATGCGTA GTGTCGGTTC GAGCCACGCC GATCCCTACA ACGCGCTGGC AGCAGCAGCG
GCGGCATTGT ATGGGCCGTT GCATGGTGGA GCCAATGAAG CCGTGTTGCG GATGTTGCAG
CAGATTGGCC ATCCCAAGAA TGTGCCGGCA TTTATCGAGC GGGTGAAGAA GGGTGAGACC
CGCCTGATGG GTTTTGGTCA TCGCGTCTAC AAGAACTACG ATCCGCGAGC TAAGATTATT
CGGCGCATTG CCCACGAAGT CTTTGCGGCC ACGGCGGCCA ATCCGTTGCT TGATGTGGCA
ATGGAACTCG AGCGGGTGGC ATTGGAAGAT GAATACTTCA TCTCGCGCAA GCTCTATCCG
AATGTTGACT TCTACAGTGG TTTGATCTAT CAAGCATTGC GCTTCCCCAT CGAGTACTTC
CCCTTCCTGT TTGCCATTCC GCGTGCATCG GGTTGGTTGG CGCAGTGGCT TGAGATGCTC
GACGATCCTG AGCAGAAGAT TACGCGACCG CGGCAGGTGT ATGTTGGCCC GCAGCGGCGT
GATTATGTGC CGATCGATCA GCGCTGA
 
Protein sequence
MTKNSLTVID NRTGKTYEIP IEHGAIRATD LRQIKVSDDD FGLMSYDPAY LNTASCKSSI 
TFIDGDKGIL EYRGYPIEQL AEQSSYLEVA YLLLYGELPS KERLEWWEYR ISRHLFLHNS
LVELIQAFRY DAHPMGILIS SVAAMSTLYP EAKNIHDPAV REKQIWRIIG QIPTIAAFAY
RHRIGRPFNL PDSSLSYTAN LLYMMDYMNQ REYEVNPVLA KALDVLFILH ADHEQNCSTS
VMRSVGSSHA DPYNALAAAA AALYGPLHGG ANEAVLRMLQ QIGHPKNVPA FIERVKKGET
RLMGFGHRVY KNYDPRAKII RRIAHEVFAA TAANPLLDVA MELERVALED EYFISRKLYP
NVDFYSGLIY QALRFPIEYF PFLFAIPRAS GWLAQWLEML DDPEQKITRP RQVYVGPQRR
DYVPIDQR