Gene Caul_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2787 
Symbol 
ID5900242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3025391 
End bp3026671 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID641563279 
Productcitrate synthase I 
Protein accessionYP_001684412 
Protein GI167646749 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.420339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.059507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AAGCCACGCT GACGATCGGC GACAAGAGCT ACGACCTGCC GATCCTCAAG 
GGCAGCACGG GACCCGACGT CGTCGACGTG CGGAAGTTCT ATGGCGACTC CGACCATTTC
ACTTTCGATC CGGCCTTCAC CTCGACCGCA TCCTGCGAAA GCAAGATCAC CTATATCGAT
GGCGACGCCG GCGTTTTGCT GCACCGCGGC TATCCGATCG GCCAACTGGC CGAGCAGTCC
AGCTTCCTGG AAGTCTGCCA CCTGCTGCTG AACGGCGAAC TGCCGACGGC CGACGAATTC
ACCAAGTTCG AACGCAACAT CACCTACCAC ACGATGCTGC ACGCCCAGTT CGACAGCTTC
TTCCAGGGCT TCCGCCGCGA CGCCCACCCG ATGGCGGTGA TGACCGGCGC GGTCGGCGCC
CTGTCGGCCT TCTATTCCGA CAGCCTGAAC GTCGATGATC CCAAGCAGCG CGAGATCAGC
GCTCACCGCC TGATCGCCAA GATGCCGACC ATCGCCGCGC GCGCCTTCCA GTACTCGCAA
GGCCGTCCGT TCGTGACGCC GCGCAACGAG CTGAGCTACT CGGAAAACTT CCTGCGCATG
TGCTTCTCGG TGCCGGCCGA GGATTGGGTC CCCAACCCCA TCCTGACCCG CGCCATGGAT
CGCATCTTCA TCCTGCACGC CGACCACGAG CAGAACGCCT CGACCTCGAC CGTCCGTCTG
GCCGGCTCGT CGGGCGCCCA CCCGTTCGCC TGTATCGCCG CCGGCATCGC CTGCCTCTGG
GGTCCGTCGC ACGGCGGCGC CAACCAGGAA GCCCTGGAGA TGCTGGAAGA GATCGGCACG
GTCGAGAACA TCCCCGCCTA TGTGCAGGGC GTGAAAGACC GCAAGTACAA GCTGATGGGC
TTTGGCCACC GGGTGTACAA GAACTTCGAC CCCCGGGCGA CGGTCATGCA GAAGACCTGC
TACGAGGTTC TGGAGCAGTT GGGCATCGAC GACCCGCTGC TGCAAGTGGC CATGGAGCTG
GAAAAGGTCG CGCTGAGCGA TCCCTACTTC ATCGATCGCA AGCTCTATCC GAACATCGAC
TTCTATTCGG GCATCACCCT GCGCGCGATG GGTTTCCCGA AAGAGATGTT CACGGTGCTG
TTCGCCCTGG CTCGCACCGT CGGCTGGATC AGCCAGTGGA AGGAAATGTT CGAGGACCCC
AACCGCAAGA TCGGCCGCCC GCGCCAGCTC TATACGGGCG CCACGCAACG CGACTACGTG
TCGGTCGACA AGCGCGGCTA A
 
Protein sequence
MTDKATLTIG DKSYDLPILK GSTGPDVVDV RKFYGDSDHF TFDPAFTSTA SCESKITYID 
GDAGVLLHRG YPIGQLAEQS SFLEVCHLLL NGELPTADEF TKFERNITYH TMLHAQFDSF
FQGFRRDAHP MAVMTGAVGA LSAFYSDSLN VDDPKQREIS AHRLIAKMPT IAARAFQYSQ
GRPFVTPRNE LSYSENFLRM CFSVPAEDWV PNPILTRAMD RIFILHADHE QNASTSTVRL
AGSSGAHPFA CIAAGIACLW GPSHGGANQE ALEMLEEIGT VENIPAYVQG VKDRKYKLMG
FGHRVYKNFD PRATVMQKTC YEVLEQLGID DPLLQVAMEL EKVALSDPYF IDRKLYPNID
FYSGITLRAM GFPKEMFTVL FALARTVGWI SQWKEMFEDP NRKIGRPRQL YTGATQRDYV
SVDKRG