Gene Caul_4949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4949 
Symbol 
ID5902411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5347471 
End bp5348619 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content76% 
IMG OID641565469 
Productcitrate synthase 
Protein accessionYP_001686567 
Protein GI167648904 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.537298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.392134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT GGATCGACGC GGAACGGGCG ATGGCGACGC TGGGGGTGCG GGCCCAGACG 
CTGTACGCCT ATGTCAGCCG GGGCCGGGTG GCCGCCGCCG CCCATCCGGA CGATCCACGC
CGCAGCCTCT ACCGCGCCTC CGATATCGCG GCCCTGGCGG CCAAGAAGGC CCGGGGGCGG
CGCGCCGCCG ACGTGGCGGC CGAGGCCATC GCCTGGGGCG AGCCGGTCTT GCCGTCGGCG
ATCACCACGG TGGTGGACGG ACGGCTCTAT TATCGCGGCC AGGACGCCGT CGATCTGGCC
CGCATGCACA GCCTGGAGCA GGTGGCGCGG CTGCTGCGCG GCGGCCATGG CGCGCCGCTG
ACCGGTCCGG AGCCGAAGAA ACCCAAGAAG GGCGAGACGC CGCGCGCCCG CCTGTTCTCG
ACCCTGGCCG TGCGGGCCGG GACCGACCCG CCGGCCCGGG GCCGGGCGCC CCTGGCAATG
GCCATGGAGG CCGCCGGCCT GCTGGAAGCC GCCGCCGACG CCGTGGCCGG GTCAATCGGC
CAGGGTCCGA TCCATCGCCG GCTGGCCGAC GCCTGGAGCC TGGACGCTTC GGGCGCCGAC
CTGGTCCGCC GCGCTTTGGT GCTGCTGGCC GACCACGAGC TGAACGCCTC CACCTTCGCG
GTGCGGGTGG CGGCCTCGAC CGGCGCCTCG CTGGCGGCTG CGTCGCTGGC CGGTCTGGCG
GCGCTGTCAG GCCCCCTGCA CGGCGGCATG GCGGCCCGGG TGGAGATGTT CGTCGACGAG
GCCGAGCGGC GCGGCCCGAC CCGCGCGGTG GCCGAGCGCC TGGCCCAGGG CTCGGCCATG
CCAGGCTTTG GCCACCCGCT CTATCCCGAC GGCGACCCCC GCGCGGCGGC CCTGCTGGAG
GCGTTCAAGG TTCCGGAGGG GCTGGCGCAA TTGCGCGCCG AGACAGAGGC CGCCACGGGC
CTGCGCCCCA ACATCGACTT CGCCCTGGTG GCCATGGCCC GCGCCCGCGC CCTCCCCGCC
GACGCGCCCT TCAGCCTGTT CGCGGTGGGC CGGATGGCGG GCTGGCACGC CCACGCGATC
GAGCAATTGC AGACGGGGCA GCTGATCCGG CCGAGGGCGC GGTATGTGGG CGTGCGGCCG
GGCGCTTAG
 
Protein sequence
MADWIDAERA MATLGVRAQT LYAYVSRGRV AAAAHPDDPR RSLYRASDIA ALAAKKARGR 
RAADVAAEAI AWGEPVLPSA ITTVVDGRLY YRGQDAVDLA RMHSLEQVAR LLRGGHGAPL
TGPEPKKPKK GETPRARLFS TLAVRAGTDP PARGRAPLAM AMEAAGLLEA AADAVAGSIG
QGPIHRRLAD AWSLDASGAD LVRRALVLLA DHELNASTFA VRVAASTGAS LAAASLAGLA
ALSGPLHGGM AARVEMFVDE AERRGPTRAV AERLAQGSAM PGFGHPLYPD GDPRAAALLE
AFKVPEGLAQ LRAETEAATG LRPNIDFALV AMARARALPA DAPFSLFAVG RMAGWHAHAI
EQLQTGQLIR PRARYVGVRP GA