Gene Caul_4817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4817 
Symbol 
ID5902279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5215699 
End bp5216973 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content64% 
IMG OID641565337 
Producthypothetical protein 
Protein accessionYP_001686435 
Protein GI167648772 
COG category[S] Function unknown 
COG ID[COG5338] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03016] uncharacterized protein, PEP-CTERM system associated 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGCA ATACGCTTTG GTTCGTCGGA ACATCGGTTT GCTGTCTGGT CGTCGCGGCT 
TCAGCGCAGG CGCAACAGGT CGGCGAGCAG TTCCGCCGTG ATCATAATGT CAGCGTTCGC
GATCGTCCGC GCCCTGAGCT AGAGCCTCTC CGCATTCGGG CCGGCGCCTT CACCCTCGCG
CCCAAGGCCA CCGCGTCCGT GACTACCGAC GACAATATCT ACGCGGCCAA CGCCAACGAG
ACGGACGACG TGATCGGCAC GCTGATTGGC GAGATCAACG CCACGACCAA CTGGTCGCGG
CACGACGTCA CCGCCAACGT GAAGCTGCAG CACGACGACT ATCAGGACCA TTCGGAGGAG
AATTCGACCA CCTACGGCGC CAACGTGGCC GGTCGACTCG ACGTCATGCG CGATTTCGCC
CTCATCGGCG GCGCGCGTTT CGAGCACGAT GTCGAGGCGC GCACCGCGTC TGGCGTCGCC
TCCACCCTCC TGGCCAAGCC GGTGCGCTAC GACCTGAGCG GTTTCGACCT CGGGGGCGCC
CGGGCGTTCA ACCGCCTCCG GGTGACGGCG GCCTACGCCT ATCGCGACAT CAATTACGAC
GACGCGCGCG ACCAGAACAA TACGGTGGTC GACCAGGACT ATCGCGACCA CTCCACCGAG
CGCGCCTCGC TGCGCGGCGA CTATGCGGTG AGCCCCAACT CCGCGTTCTT CGTGAAGGCG
GAGGGCAATC GCCGGCGCTA TGACAGCGCT TCGTCCGGCG GCTTCAATCG CGACTCGGAC
GGCTACCAGA TCACGACGGG GGCCGACATC GACCTCAGCG GCCTCGTGCG CGGTCAGGTG
CAGGTCGGCT ATCTCTCGCA GGACTTCGAA GATCCCACGA CGTCCGACAT CTCGGGCTTC
GCCGCCTCCG GCGAGGTCGA ATGGTTTCCG ACCCAGCTCA CCACCGTCAC CTTCCGCGCC
GCGCGCGAAG TGCAGGACAC CGGCCTGATC ACCAGCCCCG CCGCCCTGAG CACCACCGGC
GGCGTGCAGG TTGATCACGA ACTCCTCCGC AACCTGCTGC TGACTGGGCG ATATGAGTAC
AGCACCAGCG ATTATCAGCG GATCGATCGC AAAGATGACC GCTCCGCGGC CATCTTTGGC
GTTAACTATC TCGCGAACCG ACACTTTAAC GTTCAGCTGT TCTATTCGTA TCTGAAGCAG
TCATCGAGCG GCGCGCAGCC AGGTGTTGAC TACACCGTGA ACCGGCTGTC GGTCGCGCTG
GTAGCCAAGT ACTGA
 
Protein sequence
MVRNTLWFVG TSVCCLVVAA SAQAQQVGEQ FRRDHNVSVR DRPRPELEPL RIRAGAFTLA 
PKATASVTTD DNIYAANANE TDDVIGTLIG EINATTNWSR HDVTANVKLQ HDDYQDHSEE
NSTTYGANVA GRLDVMRDFA LIGGARFEHD VEARTASGVA STLLAKPVRY DLSGFDLGGA
RAFNRLRVTA AYAYRDINYD DARDQNNTVV DQDYRDHSTE RASLRGDYAV SPNSAFFVKA
EGNRRRYDSA SSGGFNRDSD GYQITTGADI DLSGLVRGQV QVGYLSQDFE DPTTSDISGF
AASGEVEWFP TQLTTVTFRA AREVQDTGLI TSPAALSTTG GVQVDHELLR NLLLTGRYEY
STSDYQRIDR KDDRSAAIFG VNYLANRHFN VQLFYSYLKQ SSSGAQPGVD YTVNRLSVAL
VAKY