Gene Caul_0483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0483 
Symbol 
ID5897938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp523584 
End bp524753 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID641560966 
ProductDNA binding domain-containing protein 
Protein accessionYP_001682115 
Protein GI167644452 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01764] DNA binding domain, excisionase family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACG ACACGTTTTT CATGTCTGCA GACGAGGCCG CCCAGACGCT GGGCGTCAGC 
TTGACGACGC TCTATGCCTA TGTGAGCCGC AAAGGCATCC GGTCCGAAAA ACAGCCCGGG
TCGAAGAGCC GTCGGTACTG GCGGGCGGAC GTGGAGATCG CGCGCACGCG TGGCAAGCCG
GCCGATCCAT CGGCGGGCCT GATCAGATCG ACCGCCGTCA CGCTGATGAC TGGGCAGGGG
CCGTTCTATC GCGGCGTCTC GGCGATCGCG CTGTCGCAGA CGGCGAGCAT CGAAGAGGTT
GCAAGCCTCC TATGGGACGC GCCCGACGCC TTTGCCGATA CGACACCAAC CTTGCCGTCG
GACTATGACG ACGCCGCGCG GGTCATGTCG TCCCTGCCCA GGACGGCGCG CGCCATCGCC
CTGTTTGGCT TGATGGAGCA GGCCAATCCT CGCGCGCACG ACCTTTCTCC CAGTGGATAC
GCGCGCACCG GGGCGGGCGT CATTCGATGC TTCGCCGCGA TCGTCGGCGG CGGCTCCCCG
ACAGAGGTCG CGCCCATCCA TGAGAGTCTG GCGCGAAGTC TCAATGCGCC GCCGGGCTAC
GCGGACGCCA TTCGAAGCTG CCTGGTGCTT GCCGCCGATC ACGAACTGGA TCCGACGACG
TACGCCGTCC GCGCCGCGGC CAACACGGGC GTGACACCCT ATGGCGCGGC CATGGCGGGC
TTGATCGCCG GGCGCGGCCG CCGGCTGAAG CTTGCGCGCG CCGAGCGTGT CGCCCGGTTC
GTCGACGAGC TGATGACGGG CGATCCGAGA GAGGCCGTCG TCTCGCGCTT TCGGATCGGC
GAAGCGCTCC CTGGTTTCGG CGGTGAGCTC TATGCCGATA CCGATCCCCG GGCCCAGGCC
CTGATCTCGG CGCTTCGGGC CAGCGTTCCT GGCGAGCTCA TGGAGCGTCT GGACAAGGTT
ATTGAGGTCG CGGCGGACCT GACCGGCGCC GGCCCGGATT TCATTCTGCC AACGGTGTTT
CTGGGCAGGC TACTGGGCTT GAAGGGTGAG GAACTGGCGG TCTCGACCGT GGGAAGAATG
GTCGGGTGGA TCGCCCACGC CATGGAGCAA TACCAGGACA ACGACCTCTT CCGGCCGCGC
GCGGCCTATG CTGGCAAGTT ACCGAACTGA
 
Protein sequence
MNDDTFFMSA DEAAQTLGVS LTTLYAYVSR KGIRSEKQPG SKSRRYWRAD VEIARTRGKP 
ADPSAGLIRS TAVTLMTGQG PFYRGVSAIA LSQTASIEEV ASLLWDAPDA FADTTPTLPS
DYDDAARVMS SLPRTARAIA LFGLMEQANP RAHDLSPSGY ARTGAGVIRC FAAIVGGGSP
TEVAPIHESL ARSLNAPPGY ADAIRSCLVL AADHELDPTT YAVRAAANTG VTPYGAAMAG
LIAGRGRRLK LARAERVARF VDELMTGDPR EAVVSRFRIG EALPGFGGEL YADTDPRAQA
LISALRASVP GELMERLDKV IEVAADLTGA GPDFILPTVF LGRLLGLKGE ELAVSTVGRM
VGWIAHAMEQ YQDNDLFRPR AAYAGKLPN