Gene Caul_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0007 
Symbol 
ID5897719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp9533 
End bp10543 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content60% 
IMG OID641560490 
Productputative DNA-binding protein 
Protein accessionYP_001681643 
Protein GI167643980 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.341702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG GCGAGATCGT TCTCTACAAT ACCGAAGACG GGCTCGCGCG CGTGCAACTG 
CGGGCGGCCG ATGGGACGGT TTGGCTGACC CAGGCGCAGA TGGCCGATCT GTTCCAGACG
TCGCGGCCCA ATATCACCCA GCATCTCATC GCGATCTTCG AATCGGGCGA ATGTCTGGAG
GCAGCAGTAT GTAAGCAAGA CTTACTAACT GCCGCTGATG GCAAACGATA TCAGACAACG
CTCTATCGCC TCGAAGCCAT CCTCGCTATC GGCTACCGGG TGCGATCAGC GCGCGGCGTG
CAGTTCCGAC AATGGGCGAC AACCCGTCTG CGGGACTATC TGGTCAAGGG CTTCGTGCTC
GACGAGCAGC GCCTGCGTGA GCCAGAACCC TTCGATTATT TCGACGAACT GCTTGAGAAG
ATCCGCGACA TCCGGGCCTC GGAGAAGCGG TTCTACCAGA AGGTCCGCGA TGTCTACGCA
ATGGCCGCCG ACTACGATGG GCGATCCGAA GCCGCCCAGA TCTTCTTCGC CACTGTTCAG
AACAAGATGC TCCACGCTGT CTCGGGCAAG ACGGCGGGAG AGCTGATCGT AGCCCGCGCT
GATCCTATCG CGCCAAATAT GGGGCTGCGA AGCTGGAAGG GCGGCAAGGT TCGCAAAGGC
GACGTCGATA CCGCCAAGAA TTATCTAAGC GCTGACGAGG TCAGCGATCT TAATCGCATC
GTGACGATGT TTCTGGACTT CGCCGAGGAT CAGGCTCGCC GTCGCAAGCC GATGACCATG
GCCGAGTGGG CGGCGCGCCT GGACGCCTTC TTGGCCTTCA ATGATCGTGA CGTCCTGGCC
AGCGCCGGTC GGGTCACCGC GGACGAGGCC AAACGTGTCG CGCACAAGCG GTTCGAGGTA
TTCGACACCG GACGACGCGC GGCGGAATCG CACGCGGCCG ATGCCGAACA TGTCGAGGAA
ATGCTTCGGC TAACCGAAGA GCTCAAGCCC GCCTCGAAGA AACGCTTCTA G
 
Protein sequence
MSEGEIVLYN TEDGLARVQL RAADGTVWLT QAQMADLFQT SRPNITQHLI AIFESGECLE 
AAVCKQDLLT AADGKRYQTT LYRLEAILAI GYRVRSARGV QFRQWATTRL RDYLVKGFVL
DEQRLREPEP FDYFDELLEK IRDIRASEKR FYQKVRDVYA MAADYDGRSE AAQIFFATVQ
NKMLHAVSGK TAGELIVARA DPIAPNMGLR SWKGGKVRKG DVDTAKNYLS ADEVSDLNRI
VTMFLDFAED QARRRKPMTM AEWAARLDAF LAFNDRDVLA SAGRVTADEA KRVAHKRFEV
FDTGRRAAES HAADAEHVEE MLRLTEELKP ASKKRF