Gene Caul_3499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3499 
Symbol 
ID5900954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3776192 
End bp3778693 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content57% 
IMG OID641564005 
Producthypothetical protein 
Protein accessionYP_001685124 
Protein GI167647461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTTG AATTTGTAGG CAGGAACCTC CGTGTTCGTT GGAGTGAGAA CCCACTAAAC 
GCCGGGATTG GCGTTCGCGG ATACCAGCTT GAATTTGACG GTACTGACGG GACGATCACG
CACTTCGTTG ACGCCTCGAA ATTCCAGTCT GAAATCGGCG GCGTTCGCAA CTACGCGGAC
ACGCTGCTCT TCGAAGACCT CCTATCGGTC GGCGTTACGC GCTCTGTCGT GGTCCGCCTT
AAGAGCCAAT CGTCAAGCGG CAAGTTCTCC GCGCCGGTAA CGATCACCGC TCAGAACCCC
GCACCAGAGC TAAGCGCTCC AACGATCACC CCATCATTCA GCGGCTTTGA CGTTCGCATC
ACTGCGCCAA ACGCGCGCGA TCTCGCGGGC TACATCATCG CGGTTGGAAC GTCGTCTTCG
TTTGCACATA CCAACCCCGT CAATTGGGTC CACAACGGAC CAGACACGTC GGTTGTCGTC
GCTCTTCCTG ACACCACCAT TAGATATGTC CGCGTCGCCG CGTATGACGT ATTTGGCACC
GATGCTCTGA TCTGGACGAC CGCTCAATCG GTCATGAAGA TGACAAGCGA TCTCTCAGAG
ATCGTATCCG GTGTCGATGA GCTTCACGAC CAAGTTGCGG TCCTGAATGC CGACGCGATC
ATCAACGCGC AACTACTCGT CGATGCAGCC GAACAGAACA TCAAGACGCA ATTGAACCTC
GACGATCAGG TGGATTACTG GATCGACTTG GGACACCTAG AAGGCATTCC CATCGGCAGC
ATTGTCGAGG AAACGCGCAC CAAGACAGAC GAGCTAGTCG AGGTTCAAAA CCGCTATCTG
GTGAAGACTG CCGATGGCCT GGGAGTGAGC ATCGACCTCA CGAAGGTCAT GGTCGGTCCA
ACAGAGTCGC TATCGCAAAG GCTGAACACC ATCGCGGCCG ATACAGGCGC GGATGTCAGC
GCGGCAATTG AAGACCTCAA CCAAGCCCTG ACAACCAAAA TCAACGCTGA AGCGACAGCA
AGGCAGACTC TTCAAACGGA CTTTGAAGGG AATCTCGCGA GCGCTGTTTC GACGACCAAG
GCCTATAGCG ACAACAAGCT CACGACGACG CTGAACAGCT ACGCGACCCT AAGTCTCGTG
AACGGGAACA AGAGCGCGGC AGATAGCTCG ATCTCGACAC TGACGACGAA CCTATCCGCC
GAAGTGACCG CGCGCATTCA GTTGGCAACG ACCGTTGGAA ACAACAAGTC ATCTGCCGAT
AGCTCGATCT CAACCCTGAC CACGAACCTA GCGGCTGAAG TCACAGCCCG AACCAATCTC
GCAACGACGG TCGGCACGAA CAAGTCGAAC GCAGACGCGC AACTGCTCGT CCTAACGACT
GCAAAGGACG CACAAGCATC GCAGCTTTCC ACCCTTCAAA CGACGGTCGC GGGTCACACG
GCAACCATCG CAACAAATGA CACCGCATTC ACGACGGAGA AGGCCGCACA GGCGACCCGT
AACAGCGTGA TCGATGCGAA GTTCAACGGC ACAACGAGCA GCACGATCTA CACCGCTGCT
CAAGCGGCAG CGACACAGGC ATCGGCTGTT GCGACCACCC TGAACCAAAT GGGCGTGACG
ATCGGTCAAG GCAGCGCCTG GGCCATCGAC AGCAACAAGG TCTCGGTGTC CGCGACGGAA
AGCCTAGCGA CGCGCCTAAC GAGCATCAAT TCTGAGATGG GTACGAAAGC GACACCGAGC
TATGTCAGTG CTCAGATCAG CACGGCTATC AGCACGGCGA CGGGTCCTGG CAGTTCGATT
GCGACCTCGC TGTCGAACTT GTCCTCCACC GTTGGCGGCC AAACAGCGTC GATTACGACC
CTGCAACAGG TCCAGAACGG CAACAGTGCT CTCTACGGAT GGAGCCTTAA TAGCGGCGGT
ATTGCGGTCG GTATGAAGGC CCTGAACAAC GGGTCGGCCG GTACGAATGC GATCATCTTT
TCAACCGACA ACTTCTACGT CAACACGCCC GGCGGCAACT TGCCCCTGCT GGCGATCAGC
AACGGCAGGA TGGTGTTCAC GGGTAACGTG GACATCAACG GCAACCTGAT TGTCAGCGGT
TCGATCACAA CGAACGGCAT CGCAATCGGT GCGGTTTCAA GCACGGTCGC GACTTCAGGT
AACTACAACG GCGGCTTTGG TAACAGCGGG AACACCGCTC AAGTCGCAAC GCTCACGTTG
GTTTCAACCG GCAAGCCAAT CCTGATTTCG GGCATGTATA GCGGCATGTT GGTTTCGGGT
CCGTCATGGA TCAACGCTAC CGGCATCATC ACTCGCAACG GCACGACGAT TCTCGAAAGC
GCCGCTTACG CGCCTCGTAG TGGTCGATAC ACGCTACCAT TCCAGATCGT CGATAATCCC
GGCCCTGGGA CATGGACTTA CAATATCCAC GACACCGTAG GTACGGGCGG TTACAACGCT
TTCTACTTCT ACGCTCTGTC GGCAACGGAG CTAAAAGTAT GA
 
Protein sequence
MIVEFVGRNL RVRWSENPLN AGIGVRGYQL EFDGTDGTIT HFVDASKFQS EIGGVRNYAD 
TLLFEDLLSV GVTRSVVVRL KSQSSSGKFS APVTITAQNP APELSAPTIT PSFSGFDVRI
TAPNARDLAG YIIAVGTSSS FAHTNPVNWV HNGPDTSVVV ALPDTTIRYV RVAAYDVFGT
DALIWTTAQS VMKMTSDLSE IVSGVDELHD QVAVLNADAI INAQLLVDAA EQNIKTQLNL
DDQVDYWIDL GHLEGIPIGS IVEETRTKTD ELVEVQNRYL VKTADGLGVS IDLTKVMVGP
TESLSQRLNT IAADTGADVS AAIEDLNQAL TTKINAEATA RQTLQTDFEG NLASAVSTTK
AYSDNKLTTT LNSYATLSLV NGNKSAADSS ISTLTTNLSA EVTARIQLAT TVGNNKSSAD
SSISTLTTNL AAEVTARTNL ATTVGTNKSN ADAQLLVLTT AKDAQASQLS TLQTTVAGHT
ATIATNDTAF TTEKAAQATR NSVIDAKFNG TTSSTIYTAA QAAATQASAV ATTLNQMGVT
IGQGSAWAID SNKVSVSATE SLATRLTSIN SEMGTKATPS YVSAQISTAI STATGPGSSI
ATSLSNLSST VGGQTASITT LQQVQNGNSA LYGWSLNSGG IAVGMKALNN GSAGTNAIIF
STDNFYVNTP GGNLPLLAIS NGRMVFTGNV DINGNLIVSG SITTNGIAIG AVSSTVATSG
NYNGGFGNSG NTAQVATLTL VSTGKPILIS GMYSGMLVSG PSWINATGII TRNGTTILES
AAYAPRSGRY TLPFQIVDNP GPGTWTYNIH DTVGTGGYNA FYFYALSATE LKV