Gene Caul_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4143 
Symbol 
ID5901605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4495611 
End bp4496672 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID641564664 
Producthypothetical protein 
Protein accessionYP_001685765 
Protein GI167648102 
COG category[S] Function unknown 
COG ID[COG4246] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.980511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.981417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGGAC TGCGCGGCTT TCTCGTCCTG GGCCTGGCCT CGCTGGCCCT GGGCCAGTGC 
GCCAAGGCCC CGCCGCAACC GGCCGTGCTG CCGGTGGCCC CGGTCAAGGT CGGGCCGGAG
ATCGGCCTGG TCACGACGCC TGTGCCGCTG AGTTCGGCCA ACCCGCCGCC CGTCGCCCTG
GGCCGCTTCG TCTATGCCGG CGGCGTCGCG ATCAGCAGCC CCGACACCAC CCGCCTGCAC
GGCCTGTCGG ACCTGAAGTT CGGCCCCGAC GGCGCCCTGG TCTCCGTCAC CGACGACGGC
GACCTGTTCG AGGCCCGGTT GAAGCTGGAC GACACCGGCC GCCTGGTCGG CCTGACCGAC
GGCAAGCTCT CGCCGCTCAA GGGCCTGGAC GGCCAGCCGC TGCAGGGTAA GGTGCAGTCC
GACGCCGAGG GCCTGGCGTT CCTGGCCAAT GGCGACCGGC TGGTCAGCTT CGAGCGCGAT
CACCGCATCT GGCTCTATCT GCGCCAGAGC GACGGAACCT ACGGCCTGCC GCGCGCCGTC
AACAAGCCGG CCACCACCTT CCCCGACAAC GAGGGCATGG AGGCCCTGAC CGCCTATCCG
ATCGCCGGGC CGGACGCCTA TCTGGTGGGC GGCGAGGAGG GCGAGGTGTG GCTGTGCAAG
GTCTCGGCGC CGTGTGCGAG CGTGACGCCG CAGTCGCCGC CCGACTTCAC CTGGGGCCTG
ACCAGCTTCG CCGCCTTCGA GGGCCAGGCG GTGGCCGCCC TCTATCGCAG TTTCGATCCG
GTTCGCGGCT GGCGCGGCCA GGTGCGGTTC GTCGTCGACC CTCGCGCCCC CGCCGCCAAG
CAGGTGGTGG CCGCGACGCT GAACCTGGAC GGGGCGACCA CCCGCGACAA TTTCGAGGGG
ATCGCCCTGT CGCGCAGTCC GTCCGGCGCG ACGCGGCTCT ACATCCTGTC GGATGACAAC
GACACCAGCT TCGAGCGGAC CCTGCTGATG GCCTTCGACT GGACCGCTCC GCCGCCCCCG
CCGCCGGCTC CGGTGAAGAA GGCTCCGGCG AGGAGACGGT GA
 
Protein sequence
MIGLRGFLVL GLASLALGQC AKAPPQPAVL PVAPVKVGPE IGLVTTPVPL SSANPPPVAL 
GRFVYAGGVA ISSPDTTRLH GLSDLKFGPD GALVSVTDDG DLFEARLKLD DTGRLVGLTD
GKLSPLKGLD GQPLQGKVQS DAEGLAFLAN GDRLVSFERD HRIWLYLRQS DGTYGLPRAV
NKPATTFPDN EGMEALTAYP IAGPDAYLVG GEEGEVWLCK VSAPCASVTP QSPPDFTWGL
TSFAAFEGQA VAALYRSFDP VRGWRGQVRF VVDPRAPAAK QVVAATLNLD GATTRDNFEG
IALSRSPSGA TRLYILSDDN DTSFERTLLM AFDWTAPPPP PPAPVKKAPA RRR