Gene Caul_5449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5449 
Symbol 
ID5897131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp162668 
End bp165115 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content64% 
IMG OID641550736 
Producttype III restriction protein res subunit 
Protein accessionYP_001672222 
Protein GI167621714 
COG category[S] Function unknown 
COG ID[COG4951] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.788934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC GGGAAGATCG ACAACGGCGG CTCCAGGAAC GTCTTCGCCA GTTGGAGCAA 
GAGCGGGCGG CGATCGAGGA CGAACTCGCG GGAATGGTCA TGGCCGCCGC TCGCGAGACT
TCGCGCCCCC CGGCAATGGC GTTGCAACAG CCGCGGCAGG ATCAAGCCTT TGACAATCGC
GCCAAGGTTG AACTTTTTCG AAGCCTGTTT CGGGGGCGAA GCGACGTATT CCCGCTGCGT
TGGGAAAACC TGAAGACAGG TAAGAGCGGC TACGCGCCGG CCTGCGCCAA CGAGTGGAAG
CGGGGTCTGT GCGAGAAGCC GCGGATCAAG TGCTCTGTGT GCGCAAATCA GGCTTTCATT
GAAGTCAGCG ACCAGGTGAT CACCCACCAC CTGAGGGGGC AAGGCCCGGG CGGCGCCGCG
TTCGTCGCGG GCGTCTACCC GGTTCTACCG GACGACACCT GTTGGTTCTT GGCGGCCGAC
TTTGATGAGG CGGAATGGCG ACGGGATGTG AAAGCCTTCG CCGAAACCTG CCGCGCCTGG
GATGTGCCTG TCGCCATTGA ACGATCACGC TCCGGCAACG GCGCCCATGC GTGGATCTTC
TTCAGCGAGC CGATTTCGGC CTCGCTGGCC CGACGCTTGG GATCGGCTCT GATCACCGAG
ACCTTGGACC GGACGCCCGA CATCGGGTTT GCGTCCTATG ATCGCTTGTT TCCCAGCCAG
GATACCGTCC CAAGCGGCGG CTTTGGCAAC CTCATTGCCC TGCCGTTGCA GGGCTTGGCG
CGCCAGGCCG GCAATAGCGT GTTCCTGGAC GATGACCTCG ATCCCTACGA CGACCAATGG
AGGTGCCTGG CTGGCGTCCG GCGTCTCAAA CGCGACACCC TGGAGGCCCT GGTTGATGCC
GCCAGCGCAG CCGGTCGGAT TCTTGGGGTC AGGATCCCTG TCGATGACGA TGATGAGGAG
CCGTGGCTGG CCCCGCCCTC GCGCCGGCGA ACGCCGCCGG CGATTGCCGG GCCCCGGCCG
AGCAATCTCA CGATGGTCGT CGCAGACCAG CTCTATATTC CGCGCAGTGG TCTGCCGTCT
GGCCTGGTCG CGCGCCTGAT ACGGCTGGCG GCGTTTCAAA ATCCTGAGTT CTACGCCGCC
CAGGCGATGC GGTTTGCGAC CCACGACAAG CCACGGATCG TATCGTGCGC GGAGCTGACC
GTGAACCACA TCGGTCTGCC GCGGGGCTGC TTTGATGTGG CGATGGACCT GTTCGCGTCG
CTGGGCGTCG CGGTGGAAAT CGAGGATCAG CGCCGTCGTG GCGCGGCGAT CAACATTTCA
TTCAGTGGCG TATTGCGACC GGACCAGGAG CTGGCGGTCG ATGCGCTGCT GCCGCACGAC
ATCGGCGTGC TCGCCGCGAC GACGGCTTTC GGGAAGACCG TGGTGGCGGC GCGGATGATC
GCGGAGCGCG GGGTCAACGT GCTCGTTCTG GTCCATCGTC GCCAGCTGAT GGACCAGTGG
GTGGAGCGCC TCGGCGCGTT TCTCAACACC GCGCCAGGGA TGATCGGCAA AATCGGTGGC
GGCAAACGCA AGCCCTCGGG CCTCATCGAC ATCGCCCTTA TCCAGAGCCT GGTCAGAAAA
GGGGAGGTGG ACGATATCGT GGGCGACTAT GGCCACCTCA TCGTCGATGA ATGTCACCAT
CTTTCGGCCG TTAGCTTCGA GCAGGTCGCC AGGCGGACAA AGGCCCGCTA CGTCCTTGGG
CTATCGGCGA CGGTGACCCG GAAAGACGGC CACCATCCGA TCATCTTCAT GCAGTGCGGG
CCGGTTCGAA AACGTGTCGA TGCGCGCGCC GAGGCGGCGA GACGTCCGTT CGATCATCAC
GTCCGGATTC GGCAGACGGC GTTTCGGCTG CCAGACAGCG AGGCGAACGC GGCGGCGGTT
CCGATCCAGG ACGTCTATCG GGCGCTCGCC GGCGACGAGG GCCGCAACGA ACTGATCTTC
AATGACGTGT TGGCTGCGTT GGAAGCTGGG CGCTCACCGG TCGTCATCAC TGAGCGGACG
GATCATCTGG AAGCGCTGGC CGATCGGCTT TCGCGCTTCG CCAAGAACGT TATCGTTTTA
CGTGGCAGCC AAAGCGAGCG GAAACGGCGA GAGGCGATGG AGCGCCTCGC GGCGATTCCG
GAGCAGGACG AGCGGGTGAT CGTGGCGACC GGTCGCTACC TCGGGGAGGG CTTTGATGAC
CAGCGCCTGG ACACGCTCTT CCTCACCATG CCGATCGCAT GGAGGGGAAC CTTGGCGCAG
TATGCCGGTC GTCTTCACCG GCTTCATGAC CCCAAGCGGG AAGTGGTCAT CTATGACTAC
GTCGACCGCG ACGTCCCGGT GCTCGCCCGT ATGGCGGCCA GACGCGCCAC AGGGTATTCG
GGGATCGGCT ATACGACCGT CCAGAGGCCT GGATTGTTCG ACCGATAG
 
Protein sequence
MSDREDRQRR LQERLRQLEQ ERAAIEDELA GMVMAAARET SRPPAMALQQ PRQDQAFDNR 
AKVELFRSLF RGRSDVFPLR WENLKTGKSG YAPACANEWK RGLCEKPRIK CSVCANQAFI
EVSDQVITHH LRGQGPGGAA FVAGVYPVLP DDTCWFLAAD FDEAEWRRDV KAFAETCRAW
DVPVAIERSR SGNGAHAWIF FSEPISASLA RRLGSALITE TLDRTPDIGF ASYDRLFPSQ
DTVPSGGFGN LIALPLQGLA RQAGNSVFLD DDLDPYDDQW RCLAGVRRLK RDTLEALVDA
ASAAGRILGV RIPVDDDDEE PWLAPPSRRR TPPAIAGPRP SNLTMVVADQ LYIPRSGLPS
GLVARLIRLA AFQNPEFYAA QAMRFATHDK PRIVSCAELT VNHIGLPRGC FDVAMDLFAS
LGVAVEIEDQ RRRGAAINIS FSGVLRPDQE LAVDALLPHD IGVLAATTAF GKTVVAARMI
AERGVNVLVL VHRRQLMDQW VERLGAFLNT APGMIGKIGG GKRKPSGLID IALIQSLVRK
GEVDDIVGDY GHLIVDECHH LSAVSFEQVA RRTKARYVLG LSATVTRKDG HHPIIFMQCG
PVRKRVDARA EAARRPFDHH VRIRQTAFRL PDSEANAAAV PIQDVYRALA GDEGRNELIF
NDVLAALEAG RSPVVITERT DHLEALADRL SRFAKNVIVL RGSQSERKRR EAMERLAAIP
EQDERVIVAT GRYLGEGFDD QRLDTLFLTM PIAWRGTLAQ YAGRLHRLHD PKREVVIYDY
VDRDVPVLAR MAARRATGYS GIGYTTVQRP GLFDR