Gene Caul_5230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5230 
Symbol 
ID5897323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp154022 
End bp158029 
Gene Length4008 bp 
Protein Length1335 aa 
Translation table11 
GC content67% 
IMG OID641555333 
Producthypothetical protein 
Protein accessionYP_001676664 
Protein GI167621879 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT ACATCCCAAG AACCCTGACC TACACCGACG CCAGCGGCCA GGAGCATTGG 
CTGGGCGAGG ATCAGATCGG CCAGGCCACC GCGCCGACCG TTGTCCTGGG CGAGCCAGGC
ATGGGCAAGA CGGCGTTGAT GGCCAAGCTT GGCGACCGGC CGGGGTTTCA CTACGTCACC
GCGCGCGGCC TGCTGAGACG CACGCCCGAC CTCGCCGCGG ACGAGGTGCT CGTCATCGAC
GCCCTCGACG AAGTGGCCGC CGCCCGCGAC GAAGACCCGG TGCACCAGGT CCTGGCGCGT
TTGGTCGCCT TGGGCTCTCC GCGTTTCGTG CTGTCCTGCC GGGCCGCGGA CTGGCGCGGA
GCGATCGCGC GCCATGATAT CACAGAAGAT TACCGTCAGC CGCCGCTGGA GCTGCGGCTG
ACGCCGCTGA CGCGTCAGAA CGCGCTGGAT TATTTGACGG CGACCTTGGG CGAGGCCAAG
GCCAAGGCGG TCATCGACGA GCTGGAAAGG CGCTCGTTAG AGAGCCTCTA CGGCAATCCG
TTGATGCTGA TCCTGGTCGC GCGTCAGGCC CAGACAGGCC CCCTGCCCCC GACCCGGGCC
GAGCTGCTCG ACCAATCCTG CCGGTTGTTG GGCGCAGAAC AGAACGCGCT TCACGGCCGC
TCCACACTGG CCAACCTCAG TCCAGAGGTC GCCCTGTCGG CGGCTGGCGC GGCGTTCGCC
GCCTTGCTGC TGACAGGCTC CGAAGCGGTC ACGCGTGACC CGCCTAGCAC CCGGCTGCCG
AGCGATTTGC ATTTGGCCGA GCTAGCCGAG CTGCCGGACG CCGGCGCGCT CGACACGGTA
TTGCGAAGTA ATCTCTTTCG GCCGCTTAGC GGCGAGGCCG ACCGGCACGT TCCCCTCCAT
CGCGCCGTTG CCGAATTTCT CGGGGCGCGC TGGCTGGCGG CTCGCATCGA CGCGGTGTCG
CTGCGCCGTG TCTTGGGGTT GATGACGCTA AACGGCGGTG TCCCGGCCTC CCTGCGTGGC
CTGCACGCCT GGCTCGCCCA CTTCAGCACT CAGGCGGCTG CAGCGGTCAT TGAAGTCGAT
CCTTATGGCG TGCTGCGCTA TGGCGACGCG GACCACCTCA GCCCCGCCAA CGGACGGCGT
CTGCTGGACG CGCTGGGGCG ACTGGCCTCG CAGAACCCTT GGTTCAGATC GGGAGACTGG
GCGCGCCACT CGGCGGGCGG CCTGGTCCAG CCGGCCTTGC TCCCCGATAT CCAGCGCCTT
TTACGCGCCC CAGACACAAG CTTTCAGCTC CGCTCGATCC TGCTAGACGC GGTGCAAGGC
TCGTCGATCG CCCAGACACT GGCGCCGGAT CTATGGGTGA TGGTCGGCCC CCAATCGGGT
CATCGCTTTC ACTTCGCCGA ACGCTCAGCG GCCGCTGACG TGCTAATCGG CATGCGCGAT
CCCAACCACG ACTGGCCTCG GCTGGTCGAG GACCTACACG GCTTGCCGGA CGAAGACGAC
CGGCGTCTGG CGATCGAGCT GATAACCGAT GTCGGCCCTG AACAGTTCGA CGCTTCCTTG
GTCGTCCGCG CCGTTTTGGC GTTCTTGGGG CTCTTGCCTG ACGCCGCCCA GCCGTTGGAG
CCGATCGACA CCATTGGACC CCTGCACTTG CTGGCGCTCA AGATGTCGGC CGATCAAAGC
CAGGCGGTGT TGGACGGGCT GGTCGGCCGC GTTCCGTCCA AAAAAGGCTT GGAATGGGAG
GTGCGCTACG CGCTGACCGA CTTTGTCGAC CTGCTGATCG CTCGCCGGCT GGAAGCCGGA
ACGCCGGAGC CCTTGAGTCT CCTGGCCTGG CTGCGGCTGG CCAGGTCGCG AGAAAGCCGT
TCAGGCGACT GGCAAAAGCC CATCAATCAG TTCCTGCGCG ACAACAACAA CGTCCGCCGC
GCGATCCAGC ATCACGTTTT CCTGGTCGAA ACCGATCATG AGCACGTTTG GGGCCGTTTC
TGGCGGTTGA GCGAGATCGA CGCGGGGTTT CGCCCTACAC CTGACGATGT CGCCGCGCTA
CTCGCCTCGC CAGCGCTGAG CGATCGATCC AGGCCCGAGG TTCGCGAGGC CTGGCGCGAC
ATTGTTCGGC TTGCCGCCCG GCCGGAGGGC GTACCAGATT CTGTCTTGGC GGCGGCCGAG
CGGTACGCGC AGGGCGATGG CGAACTCGAA AGCTACTTGC GCGCGCTCGT AGATCCGCCC
GTTCCGGAGT GGCAGCTAAA AGATGAGGCT CGCAAGAGAG CCGACTCCGA GGCGCGCGAG
ATCCGGTGGG CAACCCATCG TACGGACTTT GCGGCGAATA TCGAAAAGGT CCGGGTCGGC
GAGTTGGGCT GGATCGTGCC GCCGGCCCGC GCCTATTTCG GGCGTTTCAA CGACATGGAC
GATGCGCTGT CGCCGCCAGA CCGTATCGGC GCTTGGCTGG GCGAGGCGCT GGTCGAGCCG
GTGCTGTCGG GGCTCGAGGC GGTCTTGCAC CGCCAGGACC TGCCCGGCGC CCAGACCATC
GGGGAAAGCT ACGCGCAGTC CCGGCGCTGG AACGTGATCG AGCCCATTCT GGCGGCGGTT
TGCGAGCGCG CACGACTGGA TAAGGGCCTT GGCGATCTTC CAGACGATGT CCTGCTCTCG
GCTCGCCTTG GTCTGGTCTA CGAGCATGTC GAGGAAAAGT TCGGCGCGCC TCAGGTCGAG
GCGCTTTTGG ACACGGCCCT GCGGTCGCGC CCGGGACTGT ATGAGCGCTA CGCCAGGATG
CTGGTCGAGC CGCAGCTTGA GGCCCGATCG AGCCACGTGC GCGGGCTCTA TCAGCTCACC
CGACCCAAGG GCGCCGATCC GATCATCACC CAGCTGGCTA TCGAATGGCT GGCCAAGTTT
GCGCATCAGG ACCTTTCAGC CGAACATGAG CTCGTCGGGT ATCTGGTCCG GGTCGGCGCT
TGGGATGCGC TACGACAAGT CACGCAGGCG CGAAGGGCTG CCGGCTTCGA GGATGACGAA
CAGCGGCTGG CTTGGTTGGC CACCGCCTCG CTGGTCGATT TTCAGGCGGC GCGGACGGAC
TTGGACGTCG CGGCCAAAAC CCACCCCAGC CTCCTTTGGC TGATCCGCAA CCGAGGCTGG
CCTGATCGTT CGGCGGACTG GTCGCCGCAG TATGGCGAGC CAGCCCGCCT CCGGTGGATC
GTCGAAACGT TCCGCGAGCG CTGGCCCTCC GTAAGCCATC CGACCGGATC CTCGACCGGC
GATCAGAACG ACTGGGACGC CACGGAATTT CTATCCTCGG TCATCCGGCG ACTGGCGGCC
GACACCTCCG ACGCCGCGGT CGCCGCGCTC GTGGCGCTGC GCGACGCGCC GGTCGATGGC
TACACGACGC TGCTCAAGAA CGCGTCCGTC GAGCAATGGC AGGCCAAGCT CGAGGCCGAA
TTCCAGCCGG CCAATCTGGA TCAGGTTTTC AATGTGGTTC GGGTTTTGCC GCCGCGCACC
ACGTCCGATC TTCAGGCTAT CACCCTGGAT GTTCTGGACC GCTATCAGGC CCGGCTGCGC
GGCGACGACG TGGGACGGGT CAAGCTCTTC TACGGCCCGG ACGGACCGCT TGACGAAAAC
ACCTGCCGCG ATCGGTTGGC CATGCAGCTG GCCGATCAGA TGCCGTTCGG CCTCGAGCTG
GTGCCCGAGC GCCAGATGCC CGAGGGGCGG CGCGCGGATC TCGTCGTGGC CCTAGGCGGG
CTGCAGCTGC CGATCGAGGC CAAGGGACAA TGGAACCGGC AGCTCTGGAC GGCCGCCGAC
CAGCAGTTGG ACGCCTTTTA CGGCAAGGAC TGGCGCGCCC AGGGTTTCGG CATCTACTTG
GTGTTCTGGT TCGGCGCGGC CATGCCCCGG CCGCGCCATC TTCAGGCTCC CCCCGCGCCG
ATGACCCCGC CCGCCACCGC CAAGGCTCTC AAGGTGGCTC TCGTCGAGCA GATCCCCGAA
GCCCGGCGCG GCGCGATCGA GGTCGTGGTC TTGGACGTGA GCCGTTAG
 
Protein sequence
MADYIPRTLT YTDASGQEHW LGEDQIGQAT APTVVLGEPG MGKTALMAKL GDRPGFHYVT 
ARGLLRRTPD LAADEVLVID ALDEVAAARD EDPVHQVLAR LVALGSPRFV LSCRAADWRG
AIARHDITED YRQPPLELRL TPLTRQNALD YLTATLGEAK AKAVIDELER RSLESLYGNP
LMLILVARQA QTGPLPPTRA ELLDQSCRLL GAEQNALHGR STLANLSPEV ALSAAGAAFA
ALLLTGSEAV TRDPPSTRLP SDLHLAELAE LPDAGALDTV LRSNLFRPLS GEADRHVPLH
RAVAEFLGAR WLAARIDAVS LRRVLGLMTL NGGVPASLRG LHAWLAHFST QAAAAVIEVD
PYGVLRYGDA DHLSPANGRR LLDALGRLAS QNPWFRSGDW ARHSAGGLVQ PALLPDIQRL
LRAPDTSFQL RSILLDAVQG SSIAQTLAPD LWVMVGPQSG HRFHFAERSA AADVLIGMRD
PNHDWPRLVE DLHGLPDEDD RRLAIELITD VGPEQFDASL VVRAVLAFLG LLPDAAQPLE
PIDTIGPLHL LALKMSADQS QAVLDGLVGR VPSKKGLEWE VRYALTDFVD LLIARRLEAG
TPEPLSLLAW LRLARSRESR SGDWQKPINQ FLRDNNNVRR AIQHHVFLVE TDHEHVWGRF
WRLSEIDAGF RPTPDDVAAL LASPALSDRS RPEVREAWRD IVRLAARPEG VPDSVLAAAE
RYAQGDGELE SYLRALVDPP VPEWQLKDEA RKRADSEARE IRWATHRTDF AANIEKVRVG
ELGWIVPPAR AYFGRFNDMD DALSPPDRIG AWLGEALVEP VLSGLEAVLH RQDLPGAQTI
GESYAQSRRW NVIEPILAAV CERARLDKGL GDLPDDVLLS ARLGLVYEHV EEKFGAPQVE
ALLDTALRSR PGLYERYARM LVEPQLEARS SHVRGLYQLT RPKGADPIIT QLAIEWLAKF
AHQDLSAEHE LVGYLVRVGA WDALRQVTQA RRAAGFEDDE QRLAWLATAS LVDFQAARTD
LDVAAKTHPS LLWLIRNRGW PDRSADWSPQ YGEPARLRWI VETFRERWPS VSHPTGSSTG
DQNDWDATEF LSSVIRRLAA DTSDAAVAAL VALRDAPVDG YTTLLKNASV EQWQAKLEAE
FQPANLDQVF NVVRVLPPRT TSDLQAITLD VLDRYQARLR GDDVGRVKLF YGPDGPLDEN
TCRDRLAMQL ADQMPFGLEL VPERQMPEGR RADLVVALGG LQLPIEAKGQ WNRQLWTAAD
QQLDAFYGKD WRAQGFGIYL VFWFGAAMPR PRHLQAPPAP MTPPATAKAL KVALVEQIPE
ARRGAIEVVV LDVSR