Gene Caul_4789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4789 
Symbol 
ID5902251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5175565 
End bp5177109 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content69% 
IMG OID641565309 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_001686407 
Protein GI167648744 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.851867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG TGATCGGCGA AGGCGAAATC CTCATCGGAC TTAGCGAGGG GGGGCTCGGC 
GAAGGACCCG TGACCCAGCG GCTCGACCGA TCCAATCGGC ACGGCGTGGT GGCCGGCGCG
ACGGGCACGG GCAAGACCGT CACCCTGCAG GTGATGGCCC AGGCCTTTTC CGACGCCGGC
GTGCCGGTGT TTGCCGCCGA CGTGAAGGGC GACCTGTCGG GCATCGCCGC CATCGGGACG
CCCAACGACA AGATGCTGGC CCGCGCGGCG TCCATGGATC TGACCCTGAC TCCGGCCGCG
CCGCCCGTCG TGTTCTGGGA CCTGTTCGGC CAGAAGGGCC ATCCGATCCG CGCCACCATC
TCGGAGATGG GGCCGGTGCT GCTGTCGCGA CTGCTGGAAC TGAACGACGT GCAGGAGGGC
GTGCTGACCG TGGTCTTCCA CGTCGCCGAC AAGGACGGCC TGCTGCTGCT GGACCTGAAG
GACCTGCAGG CGGCCCTGAA ATACGTCGCC GACCACGCCG CCGAGATCGG CACCCAGTAC
GGCAATGTCT CGCCGGCCAC GGTGGGCGCG ATCCAGCGCA AGCTGTTGAC CCTGCAAAGC
CAGGGCGCCG AAAACTTCTT CGGCGAGCCG GCCCTGAAGC TGACCGACAT CATGCGCACC
GATGTGGCGG GGCGCGGCTA CGTCAACCTG CTGGCCGCCG ATAAGCTGAT CCAGTCGCCC
AAGCTCTATT CGACCTTCCT GCTATGGCTG CTGTCGGAGC TGTTCGAGGA GCTTCCCGAG
GTCGGCGACC CCGACAAGCC CAAGCTGGTG TTCTTCTTCG ACGAGGCCCA CCTGCTGTTC
AACGACGCGC CCAAGCCGTT GCTGGAGAAG ATCGAGCAAG TCGTGCGGCT GATCCGCTCC
AAGGGGGTGG GCATCTATTT CGTCACCCAG AATCCGGCCG ACATTCCCGA CGCGGTGCTC
GGCCAACTGG GCGCCCGCGT GCAGCACGCG CTGCGCGCCT ACACCCCCGC CGACCAGAAG
GGGTTGAAGG CCGCCGCCCA GTCTTTCCGG GTCAATCCGG CCTTCGACAC CGCCGAGACC
ATCCAGGCCC TGGGCGTGGG CGAGGCCCTG ATCTCCACCC TCGACGCCAA GGGCGCGCCC
TGCGTGGTGC AAAAGACCCT GATCCGTCCA CCCGCCTCGC GCCTGGGTCC GCTGACGCCC
GAGGAGCGCG TCGCCCTGAT CGCCAGAAGC CCGGTCGCCG GGCTCTATGA CCAGACGCTC
GATCGCGCCT CCGCCTACGA GATCCTGCAG GGCCGGGCCG CCCAGGCCCA GCAGCAAGCC
GACACGGTCG CCGCCGCGGC CGAGGCCCAG CGCCAACAGG CCGCCGCCGA AAAGGTCCGT
GAACGTGAAG AGGCGGCGGA GGCCCGCGCC GCGCCGCGAC CGCGCGCCTC CAGCCGTCAG
TCCATGGGCG AGGCCTTCGC CACCTCGCTG CTGCGCACGA TCGCCAACCA GGCGGGGCGA
GAGATCATGC GCGGCCTGAT GGGCGGCATG AGTCGCCGAA GGTAG
 
Protein sequence
MTLVIGEGEI LIGLSEGGLG EGPVTQRLDR SNRHGVVAGA TGTGKTVTLQ VMAQAFSDAG 
VPVFAADVKG DLSGIAAIGT PNDKMLARAA SMDLTLTPAA PPVVFWDLFG QKGHPIRATI
SEMGPVLLSR LLELNDVQEG VLTVVFHVAD KDGLLLLDLK DLQAALKYVA DHAAEIGTQY
GNVSPATVGA IQRKLLTLQS QGAENFFGEP ALKLTDIMRT DVAGRGYVNL LAADKLIQSP
KLYSTFLLWL LSELFEELPE VGDPDKPKLV FFFDEAHLLF NDAPKPLLEK IEQVVRLIRS
KGVGIYFVTQ NPADIPDAVL GQLGARVQHA LRAYTPADQK GLKAAAQSFR VNPAFDTAET
IQALGVGEAL ISTLDAKGAP CVVQKTLIRP PASRLGPLTP EERVALIARS PVAGLYDQTL
DRASAYEILQ GRAAQAQQQA DTVAAAAEAQ RQQAAAEKVR EREEAAEARA APRPRASSRQ
SMGEAFATSL LRTIANQAGR EIMRGLMGGM SRRR