Gene Caul_5329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5329 
Symbol 
ID5897157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp37901 
End bp39124 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content65% 
IMG OID641550621 
Producthypothetical protein 
Protein accessionYP_001672107 
Protein GI167621599 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.54087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCTC GCCATGCCAT CGACCGGATC CGCGAAGCCC TGAGCGACAC CCGGGTGGTG 
CTGCTGGCGG GGCCTCGCCA AGCGGGAAAG ACGACCTTGG CCCGCTCGCT CGCCGAAGCC
GGCCGCACCT ATCTGACCCT GGACGACGCG ACCACTCTCT CGGCTGCCAA GGCCGACCCC
GCCGGCCTGG TGCGCGGCCT AGACAAAGCT GTCATCGATG AAGTGCAGCG GGCACCCGAT
CTGCTTCTAG CGATCAAGGA CAGCGTTGAT CGGGACACCC GTCCGGGTCG CTTCCTCCTG
ACCGGCTCAG CCAATCTAAT GACCCTGCCG CGCGTGGCCG ACTCCCTGGC AGGCCGCATG
GAAACCATCC GCCTGATGCC TCTGGCGCAG TCGGAGATCC TGGGACAGCC GGCGTCACGG
TTTCTCGCAT CCCTGTTCGC GGGCCAGGCA CCGCCGCCAG GCCCGCCTCG CCTAGGCGCG
GACCTGATCG ACCTGGTCCT GGCGGGCGGC TATCCCGAGG CCCTCGCGCG TAAGACCTGG
GCTCGCCGAC AGGACTGGTA CGCTAATTAT ATCGAGGCCG TGGTCGGGCG CGACGTGCGC
GACATCGCCA ATATCGACCA ACTTGACCGT ATGCCGCGCC TGCTACGCGC CCTGGCCGAG
CACTCGGGAC AGCTGATTAA TCACGCCGGC GTCGGCGCCA GTCTCGATCT CAACCATGTG
ACGACGCAAA AATACACCGG CGTCTTTGAG CAGCTGTTCC TCGTGCGCAC CCTGCCGCCT
TGGCACAACA ACGCCCTCAA ACGGCTGACT AAGAAGCCCA AACTGCACTT CCTCGACTCA
GGCTTGCTCG CGGCGCTGAG GGGCCTGACC CCCGAACGCG TAGCCGCGGA CCGATCAAAT
TTCGGTGCGG TGCTCGAGAC CTTCGTCTTC GCCGAAGTGC TCAAACTGAC TGGGTGGAGC
GACGAGCGCT TCTCGCTAAG CCATTTTAGA GACAAGGAGC AGGACGAGGT CGATATCGTC
CTAGAAGATC GACAGGGCAA GATCGTCGGC TTGGAGGTCA AGGGATCAGC GACGGTGCGC
AGCGAGGATT TCGCGGGCCT GCGCAAACTG GCGCAGGCTG TGGGTGATCG CTTCGCGTTC
GGGGCGGTAC TGTACGACTA TGAACAGGTC GTGCCGTTCG GCGAGCGCCT GGCCGCCGCG
CCATTGTCCA GCCTTTGGGG TTAG
 
Protein sequence
MYPRHAIDRI REALSDTRVV LLAGPRQAGK TTLARSLAEA GRTYLTLDDA TTLSAAKADP 
AGLVRGLDKA VIDEVQRAPD LLLAIKDSVD RDTRPGRFLL TGSANLMTLP RVADSLAGRM
ETIRLMPLAQ SEILGQPASR FLASLFAGQA PPPGPPRLGA DLIDLVLAGG YPEALARKTW
ARRQDWYANY IEAVVGRDVR DIANIDQLDR MPRLLRALAE HSGQLINHAG VGASLDLNHV
TTQKYTGVFE QLFLVRTLPP WHNNALKRLT KKPKLHFLDS GLLAALRGLT PERVAADRSN
FGAVLETFVF AEVLKLTGWS DERFSLSHFR DKEQDEVDIV LEDRQGKIVG LEVKGSATVR
SEDFAGLRKL AQAVGDRFAF GAVLYDYEQV VPFGERLAAA PLSSLWG