Gene Caul_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3804 
Symbol 
ID5901266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4124317 
End bp4125441 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content60% 
IMG OID641564326 
ProductTIR protein 
Protein accessionYP_001685428 
Protein GI167647765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.213149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0124229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGGAA TGGACTTGAG GCGAGTTCTA ACGGTCGGGG TCACTTACAC CGGCGAGAAG 
ATTGAAGGCG TCGAGATTGA TAATCTCGGG CTGTGCCAGC CCGGCGTGAA TCAGGAGGCG
GCTGCCTTTC CGCTGTACGA ATACGACACC ATCATCATCA ATCCGAAGAG CTTCACGCAT
TTCCTGTTCG GGCAGGCGGG CGAGTTTTCG AACGAGCCCT ACGAGCTGGG AAAGCTCAAA
GGCCAGAACG AGCACTACGA TTTCGATTCC GCCTTCTATG CCGATGATCG GCAGAAGGAG
ATGGAAGCGG CGCTCGCGGC GGGCGCCACC GTGGTCTGGT GCTTGTCGGA TCCAAAACGG
GTGAACTTCT TCGGCTATCG CGAAACCCAT TTGGGCTACG CAGCGCCGAA GGTGGCCGCG
CTCGTGAAGC GTTCGACGCT TTTGGAGAAG AAGGGGCGGA AGATGGGCGC GGTCGATCCG
GACAGCCCAT TCATGCGCTA TTTCGACGTG CTATCCCGCA CGGGCTGGAC GTTTTGCCTC
TCAGACCCGG CCGACGGGAT TACGTCGATT GCCTCGACGC CGGAGGGTTA CAGCCTTGGC
GGGCGCGTGG TGCTCGGCAC GACGGTCGGA TGGCTGCTGA CGCCACCAAC GTCGCAAGAC
GCCGAGAACC AACTGGTGAT CGATAGCCTG GCGCTTGAGA AGGCCGATCC GGCGCATGAA
AAATATCATG GCATCTTCCT GAGCCACACG GGCATCGACA AGCCTTTCGT GCGCCGCCTG
CGCGATGATC TCCTTGCCCA CGGCGTGCCG CGGGTCTGGC TGGATGAAGC CGAAATCGAC
ATCGGCGATT CGCTCATCGC CAAGATCGAT GAGGGCATGA AGCTCAGTCG TTACATCGCT
GTCGTGCTGT CGACCAAATC GATCGACGCG CCTTGGGTGA AGAAAGAGCT CGACGTAGCG
ATGAACCGGG AGATCGCTAG CGGCCAAGTC GTCGTGCTGC CGCTGCTCTA TGAGGCCTGC
GAACTGCCGG AATTCCTGAA GGGAAAGCTG TACGCCGACT TTTCTAAGCC GGAGGACTAT
GAGGCGGTGC TGGCGAAGCT TCTCCGGCGG CTGCGGATCG CCTGA
 
Protein sequence
MRGMDLRRVL TVGVTYTGEK IEGVEIDNLG LCQPGVNQEA AAFPLYEYDT IIINPKSFTH 
FLFGQAGEFS NEPYELGKLK GQNEHYDFDS AFYADDRQKE MEAALAAGAT VVWCLSDPKR
VNFFGYRETH LGYAAPKVAA LVKRSTLLEK KGRKMGAVDP DSPFMRYFDV LSRTGWTFCL
SDPADGITSI ASTPEGYSLG GRVVLGTTVG WLLTPPTSQD AENQLVIDSL ALEKADPAHE
KYHGIFLSHT GIDKPFVRRL RDDLLAHGVP RVWLDEAEID IGDSLIAKID EGMKLSRYIA
VVLSTKSIDA PWVKKELDVA MNREIASGQV VVLPLLYEAC ELPEFLKGKL YADFSKPEDY
EAVLAKLLRR LRIA