Gene Caul_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0465 
Symbol 
ID5897920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp506383 
End bp507642 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID641560948 
ProductHipA domain-containing protein 
Protein accessionYP_001682097 
Protein GI167644434 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG CGCCGGGCGA GCCTCTCGCC ATCAACCTCA TCTTTGACGA AAGCCAACCG 
CCCATGCCAA CGGCGCGCCT AGCCATGGCC AAGGGGTTGG CCCAGTTGGA ATGGTCGCCC
CAGATCCTGA CCAGCAAGCT GCCGGTCTCG GGCTTGAACT ATCCGCCGGA GCCGGGCCTC
CATGCCGCCC GTCGCCGCGA CTTCGAGGGC TTGCACGGCT TCGTCGCCGA CAGCCTGCCC
GACGCCTGGG GAAGCCTTGT CGCCCGGCGG CGGCTGGCCA AGCTGGGCGT GCGCATCGAG
GATCTCGGCC CCCTTGACCG GCTGGCCCTG GTCGGACGGC ACGGCCGTGG CGCCATGGCC
TTTCTTCCCG ACACGGCGCC GCCGCCCGAG GTCGAGACCC TGGACCTGGA CGCCCTGGCC
GCCGAGGCCT TGGCCGTCCT GGCCGGCGAC GAGAGCGCGC TGGCGGCCAC CCTGGCCACC
TTGGCCAACG GATCGGGCGG GGCGCGCCCG AAGATCCACG TGGGCTTTGA CCCAAACGGC
GCGATCTCCG TGGCCGAGGG CGAGGCCGCG CCAGGCCATA CCGCCTGGAT CGTCAAGTTC
GCCGCCCCCA ACGATCAGCC GGATATCGGG CCTATCGAGG CGGCCTATGC CGCGATGGCC
AAGGCGGCGG GCCTGGATGT ATCCGAGCAC CGGCTGATTC CCGCCAAGTC CGGTCCAGGA
TACTTCGCCA CCCGGCGGTT CGACCGGCCC CAGCCGGGAC GCAGGCTTCA CATGCTTTCC
TTGGGGGGCG CGATCGAGGC GCCGTGGATG CAGCCCTCCT CCTATGACCT CTTCCTGCGG
GCCACCCTGG CCATCACGCG GCATGCCGGC GACCTGGCCG CGGCTTTCCG GCGCATGGTC
TTCAACATCC TGGCGAGCAA TCGCGACGAT CATGTCCGCC AGCACAGCTA CCTGATGGAC
CCGACAGGGG GGTGGCGCCT GGCGCCGGCC TACGATCTGA CCTACTCGGC CGGTCCCGGC
GGTGAACATT ATCTCGACGT CGAGGGCGAG GGGCGCCGCC CGACCCGGGC TCACGTCAGG
GCGCTTGGCA AGCGCCACGG CTACGACAAG GCGACTGTGG ATCGGGTCAT CGAGGAGGTC
GCCGCCGCTC TGGCGGGGTG GCCGGGCTTC GCCGACGAGG CGGGCGTCAC CAGGCTTTCC
AAGACCGACA TCGCCGCGGC CCACGCCGAC GTCGCCGGAT CCTTCTTCGC CGTGCCCTGA
 
Protein sequence
MKLAPGEPLA INLIFDESQP PMPTARLAMA KGLAQLEWSP QILTSKLPVS GLNYPPEPGL 
HAARRRDFEG LHGFVADSLP DAWGSLVARR RLAKLGVRIE DLGPLDRLAL VGRHGRGAMA
FLPDTAPPPE VETLDLDALA AEALAVLAGD ESALAATLAT LANGSGGARP KIHVGFDPNG
AISVAEGEAA PGHTAWIVKF AAPNDQPDIG PIEAAYAAMA KAAGLDVSEH RLIPAKSGPG
YFATRRFDRP QPGRRLHMLS LGGAIEAPWM QPSSYDLFLR ATLAITRHAG DLAAAFRRMV
FNILASNRDD HVRQHSYLMD PTGGWRLAPA YDLTYSAGPG GEHYLDVEGE GRRPTRAHVR
ALGKRHGYDK ATVDRVIEEV AAALAGWPGF ADEAGVTRLS KTDIAAAHAD VAGSFFAVP