Gene Caul_4631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4631 
Symbol 
ID5902093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5009634 
End bp5011343 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content71% 
IMG OID641565150 
Producthypothetical protein 
Protein accessionYP_001686249 
Protein GI167648586 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.457948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC AAGTCCATTA CGAGCTGTTC GTGCGCCGCA AGCCCGGCGC GCAATGGACG 
CTCGACATGG CGACGGAGGT CCGCACCCAC GCCCTGCAGA CCGCCGAGGA GGCGCTGGAG
CAGGGGCGAG CGATCGCCGT GCGGGTCAGC AAGGAAACCC TCGACAACGA AACCCGCGAA
TACAAGTCGA TCTCGATCTT CACCAAGGGC CAGGTCGACG GCGGCAAGGC CAAGAAGGTG
CAGGAAGACC TGGACCCGCT GTGCGTCCAG CCGTCCGACC TCTACACCGC CCACGCCCGC
GACCGCATCG GCCGGCTGCT GGAAGGCTGG CTGGCCCGGC ACAACGCCAC CCCGTTCGAG
CTGCTGCACC GTCCCGACCT GGTCGAGAAG CTGGAAGCCT CCGGCACCGA CCTGCAGCAC
GCCCTGCAGA AGATCGCCAT CCCCGAGGCC GAAGCCCGGG GCATGTCGGT CCACGAACTG
ATCCGCACGT TCCAGGGCCT GGTCGAGCGC ACCGTCGCCA ACCTGATGAA GGCCTTCAAG
AAGGGCGCCC TGCCCGATCT CGACACGGAA GGCTTCGCCC GAGCCGCCGA GCGCCTGTCC
ACCGATCCCG ACCGCGCCTT CCTGCTGGGG GCCGGCGTCG CCGCCTCGAT CGCGCCCGGC
AAGAACTGGT CCGAGAAGAT CGCCCGCCTG GTCGATCTGG CCGACGCCGC CCCGACCGAA
CCCAAGGCCC GGGCCGCCGC CCTGGCCGCC ATCGAGACGC CCCTGGCCGA GATCATCGGC
TCCAAGGCCG GCATGGCCGA CCTGCTGGGC GCAGGCGACG CCGACCTCGG CACCACCCTG
GCGGCCATGA CCCGCCTCGC CGGCGGGGCC CAGGTCGAGG GCCTGATCCG CGTGGAGGCC
GGCGTGCGGC ACTGCATGCC CGAGCTGTCC GGCACGGCCA AGCGGCTGAG CGAGTGGCTG
AGCGGCGAGG ACTTCCCCGC CGTCCGCGCC TCGATCGCCC ACCGGGTGCT CAAGGAACTG
AACGGGGTGC GCCGCCTCAA GCCCTCCGAC GCCGAGGCCG AGATCGAACA CCTGCGCGCC
CTGGCCATGA GCCTGACGGC CGCCGCCGGC CGCATTCTTC CCGCGGAAGA CATCACCAGC
GCCTTCACCA CCCGCTCCAA GACCCTGCTG AACGGCGAGT TCATCGAAGC CCTGCTCGGT
CGCGACCGCT CGTCGCGCGA GGAGATCCAG ATGCTGATCC GCCTGGCCGA GAACGTCATG
GGCGCGGTCA ACAAGCGCAT GGCCGCCCGC TGGCTGTCGG CCAACGTCCT GGCCCTGCGC
TTCGAGCGCG AACTGCGCCA GGGTCCCGAA TCGCCGGCCG CCAAGCTGGC CGCGCTGGCC
ACCCTGCAAA AGTCCCTGGT CCGCTCGGGC CTGGTGGTCG AGGACTACCA GCCCCTGTGC
GCCCGGCTGG GCGAGGTGGG CGGCATGATC GAGGCCGACG CCCGCCTGAT CGCCATGCTG
GTCCGCGCCC CCGCGCCCCT GCCCCAGAAA CTGTCCCTGC TGATCAAGCT GGCCATGGGC
GACGCCGGCC CGACCGGACC GGTCGCCGAC AAGGCCAAGC TCGAGGCGCT GAAACTGGCC
CGCGCGCCCG AAGCCCGCGA ACAGCTGGCG GGCTCGCCGG AGACCATGGA CCTGCTCAAG
GGCATGGTTC AGCAGAAGGC GGCGGCTTAG
 
Protein sequence
MSDQVHYELF VRRKPGAQWT LDMATEVRTH ALQTAEEALE QGRAIAVRVS KETLDNETRE 
YKSISIFTKG QVDGGKAKKV QEDLDPLCVQ PSDLYTAHAR DRIGRLLEGW LARHNATPFE
LLHRPDLVEK LEASGTDLQH ALQKIAIPEA EARGMSVHEL IRTFQGLVER TVANLMKAFK
KGALPDLDTE GFARAAERLS TDPDRAFLLG AGVAASIAPG KNWSEKIARL VDLADAAPTE
PKARAAALAA IETPLAEIIG SKAGMADLLG AGDADLGTTL AAMTRLAGGA QVEGLIRVEA
GVRHCMPELS GTAKRLSEWL SGEDFPAVRA SIAHRVLKEL NGVRRLKPSD AEAEIEHLRA
LAMSLTAAAG RILPAEDITS AFTTRSKTLL NGEFIEALLG RDRSSREEIQ MLIRLAENVM
GAVNKRMAAR WLSANVLALR FERELRQGPE SPAAKLAALA TLQKSLVRSG LVVEDYQPLC
ARLGEVGGMI EADARLIAML VRAPAPLPQK LSLLIKLAMG DAGPTGPVAD KAKLEALKLA
RAPEAREQLA GSPETMDLLK GMVQQKAAA