Gene Caul_5229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5229 
Symbol 
ID5897322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp152742 
End bp153998 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content64% 
IMG OID641555332 
ProductAAA ATPase 
Protein accessionYP_001676663 
Protein GI167621878 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.303429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGCGA CCGTGATCGA GCGTGAGGCC TTGTCCAGGA CCGTGGCGGC CTTGGATCGA 
TGGCCCTGCG TCGGCCTCTA CGGGCCACGC AGTGTCGGCA AGAGCAGTCT AGCCGATCAG
ATCGCCAAGG ATCGCGGCGA TGAGATCGTT CGTCTGCACT TTGGCCGCCA GGCCGATCGC
GATCGTCTTA GGGATCAAGC TGCGTTCTTC GCCGCGCATG GCGACAAGCT TGTCGTCATC
GACGAGATCC ATCTGATGCC CGAGGCCTTT AGGATCGTGC GCGCACGTCT CGACGATTGG
GCGCGGCCCA GGCCGGGCGC TGGGCAGTTC CTGTTCATGG GCTCGGAGTC CCACCAAGTG
CGCGCCATGG CGGCCGAGGC GCTGGGCGGG CATTCGATGG CGATCGATCT GACGACCATC
CAGCCCCACG AATTGCCAAC TGCCTCGCCG ATCACCTTGA TGGAGACCTT TGACGCCCTC
CACTTTTCGG TCCCGGAGGT CGAGCCGACC GCAAGCGCGA ACCAAGCCCT TTCGATGGAT
CAGCTCTGGT CGCGCGGCGG CCTGCCCAAC AGCCTATTGG CCAGCGACGA GGCCGAGAGC
TACGTCTGGC GGCGCAACTA CCTTGACCAA ATTTTCGCCC TGGCGCCCAG TGGCCCCAAC
GCCATAGACG GCGAGCTGCG CCACTGCCTG GAAATGATCG CCACCGAGCA GGGCGGCCAG
ACGCCGCTGA CTACCTCTCC CAAAACCTTT CGCGCCGCAC TGGAGCGTCT CAAACGTATG
GGACTGGTGC GTGAGCTTCG ACCATGGTCG GGCAACGCAA AGCTGAAGCT GACCAAGAAT
CCCAAGCTCT ATATCCGCGA CTCCGGTCTG TTTCACGTTC TGCGGGGCTG CCGCACCCGC
GCCGATCTCG ACAATGCCGA CGACCGCTTA CTCGGCGGCA GTTGGGAGGG CTTCTGCATC
GAGGCGATCG CCGCGCGTCT GGGCGAGCGC GCCGACTTGT TCTTCTATCG CATCGAGGCC
AGCGACGAGC TTGATCTCGT CATTGAGTTC ACGCTGGGTG AGCGCTGGGT CGTCGAGATC
AAGTCCAACC CGATGGCGAC CATCGGCGCA GGCTTTTGGT CTGCAAGCGC CGCGCTCGAT
CCCAAACGAA AGGTGATCGT TCACCAGGGC GACGCGGCCG TCACCAACAA GAGTGGCCTG
GAAGCCTTGC CGCTGAGGAT GTTTCTCGAC CAGCTCGGCG CGAAGGCGCC CGATTGA
 
Protein sequence
MVATVIEREA LSRTVAALDR WPCVGLYGPR SVGKSSLADQ IAKDRGDEIV RLHFGRQADR 
DRLRDQAAFF AAHGDKLVVI DEIHLMPEAF RIVRARLDDW ARPRPGAGQF LFMGSESHQV
RAMAAEALGG HSMAIDLTTI QPHELPTASP ITLMETFDAL HFSVPEVEPT ASANQALSMD
QLWSRGGLPN SLLASDEAES YVWRRNYLDQ IFALAPSGPN AIDGELRHCL EMIATEQGGQ
TPLTTSPKTF RAALERLKRM GLVRELRPWS GNAKLKLTKN PKLYIRDSGL FHVLRGCRTR
ADLDNADDRL LGGSWEGFCI EAIAARLGER ADLFFYRIEA SDELDLVIEF TLGERWVVEI
KSNPMATIGA GFWSASAALD PKRKVIVHQG DAAVTNKSGL EALPLRMFLD QLGAKAPD