Gene Caul_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0149 
Symbol 
ID5897861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp165940 
End bp167196 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content70% 
IMG OID641560634 
ProductROK family protein 
Protein accessionYP_001681785 
Protein GI167644122 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.421428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGGGA AGGCCCTTGT CGAAGATCAT CGGGCGCCGT CCTTGGAAAC CTCGCGGAAG 
TCGAACCGGA AGCGCACCTC GGGCGCGAGC CTGTCGGGCG CCAATCTGGA GCGGGTCGGC
GACCACAATC AGCGGGTCAT CCTGCAGGCC ATCCGGCTGG GCGCCCCGAT CACCCGCGTG
GCCTTGGCGA AGATCAGCGG CCTGACGCCG CCCGCCGTCG CCAACATCAC TAAGCGGCTG
CTGGACGACG GGCTGATCCT CGAGGCCGGA CGGGTGCAGG GCGCGCGCGG CCAGCCGGCC
ATGAACCTGA CGATCAATCC CGACGGCTGC CTGTCGATCG GCGTCAATAT CGACCGCGAC
CACATCACCG TCGTCATGCT CGACCTGCTG GGCGCGGTGC GGGCCCGGGC CAGCCAGGAG
ATCGAGTTCC CCCTGCCCGC CGACGTGGCC CGGTTCTGCA AGACCCAGAT CCGCAAGATG
CTGGCGGCCT GGAAGGGCGA TCCGCCGCGC CTGTCGGGGA TCGGCGTGGC CCTGCCCGAC
GACCTGGGCC GGGTGGACCT GCCGCACCGG CCAGGCAACT ACGACGTCTG GAGCTCGGCC
GATGTCGGCA AGCTGCTGGC CGACATCCTG CCCCTGCCGG TGTTCCTCGA GAACGACGCC
GCCGCCGCCG CCCTCGGCGA GCTGCAGTTC GGCCATGGCC TGCGCAAGCC TAGCTTCTTC
TATGTCCTGG TCTCGTCGGG CCTGGGCGGC GGCATGGTGG TCGAGGGCGA CTATTTCCGC
GGAGCCCAGG GCCGTAGCGG CGAGATCGGC TTCCTGCCCG TCCGCTCGCC CAAGACCAAG
GCCCGGTCGC TGCAGGAGGT GGTGTCGCTC AGCGCCCTCT ACGCCCATCT GGAGGCGGGC
GGGATCACGG TCGATCGGCC AGACCAGCTG ACCGCGCTGA CCGCCAAGGG CCAGGCCCTG
GTCGCCGACT GGATCGCCCT GTCCGCCAAG CTGCTGGTCC AGCCGTTCGT GGCGATCAGC
TGCCTGTTCA ATCCCGAGGC CATCTATATC GGCGGACGCC TGCCCACCAA CCTGATCGAC
AGCCTCGTCG CGGCGGTCAA CGACCGGCTG GCGCGGGTCG AGGACGTGCC CGCCCTGGCC
CGCGTCGAAC GCGCCGCCAC CTCGGCCGAC GGACCGGCGG TCGGGGCGGC CCTGCTGCCG
TTCATGGCCC AGCTCCTGCC CTCGCGAGCG GCGCTGATGA AGACCGGCAG GGCGTGA
 
Protein sequence
MLGKALVEDH RAPSLETSRK SNRKRTSGAS LSGANLERVG DHNQRVILQA IRLGAPITRV 
ALAKISGLTP PAVANITKRL LDDGLILEAG RVQGARGQPA MNLTINPDGC LSIGVNIDRD
HITVVMLDLL GAVRARASQE IEFPLPADVA RFCKTQIRKM LAAWKGDPPR LSGIGVALPD
DLGRVDLPHR PGNYDVWSSA DVGKLLADIL PLPVFLENDA AAAALGELQF GHGLRKPSFF
YVLVSSGLGG GMVVEGDYFR GAQGRSGEIG FLPVRSPKTK ARSLQEVVSL SALYAHLEAG
GITVDRPDQL TALTAKGQAL VADWIALSAK LLVQPFVAIS CLFNPEAIYI GGRLPTNLID
SLVAAVNDRL ARVEDVPALA RVERAATSAD GPAVGAALLP FMAQLLPSRA ALMKTGRA