Gene Caul_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2243 
Symbol 
ID5899698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2437597 
End bp2440599 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content69% 
IMG OID641562734 
ProductTonB-dependent receptor 
Protein accessionYP_001683868 
Protein GI167646205 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.212076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCAG TTCTTGGGGC GCTTTTGATC GGCGCCAGCG TCGCGATCCT GGCGGCAGCC 
GCCACGCCAA CCAGCGCGAC GGCGGCAGAG CCGACCATAG CGCTCAATCT GCCCGCCGGC
CCGATGCAGA AGTCTCTGGT CGCGCTCGCG ACCCAGGCCG ACGTCAAGAT CCTGTTCGAG
ATGGATCTCG TCGCGGGGCT GACCGCGCCG GCGCTTCAGG GTCAGTTCAC GCCGCGCCAA
GCGGTCGAAA GGCTGCTCGC TGGAAGCGGC GTCGCCGTCG ACCAAGTCCG ACCGGGCGTT
TTGGTTCTGC GACCCGCGCG CTTGGGCGCG AGCGCAGAGG CTGCGGCTTT TCCGGGCGGC
GCGCTCGGCG GGGAACCGGC GCAAGCCGAC GAAACCCTGC TGTCGGAGGT CGTGGTCGGC
AGCCATATCC GTGGCGCTCA GGGCGCCTCG CCCATCGTCA CTTTCGACCG GAACGCTATT
GATCAAGGCG GCTACGCGAC GCTCGCCGAC GCCTTGACGG CCCTGCCCCA GGCCTTCGGT
GGCAGCGTGT CGGACGACAC CGGCGCGACG GGCGCCGACA CCACGGGCGT CAACACCGCC
CGTGCGACCG CCGTCAATCT GCGGGGCCTT GGCGCGGACT CGACGCTGGT GCTGGTGGAC
GGCCGGCGCA TGGCCGGCGC GGGACTGAAG GGCGACTTCG CCGACGTCTC CAGCCTGCCG
CTGGTCGCGG TGGAACGCGT GGAGGTGCTG CTCGACGGAG CCTCGGCCCT CTATGGCTCT
GACGCCGTCG GCGGTGTCGT CAACATCGTC ATGCGCAAGG ACTACGAAGG CGCCGAGACG
CGGCTGACCG CCGGGGGGTC CACGCGCGGC GACCTGCGCC AGGTGTCGAT CGCCCAGACC
TTCGGGACCC GCTGGGCAAG CGGCCATGCG TTGATCTCCT ACGAGCACCA GGACCGCGAG
GCGTTGGCGG GACGCCGGCG CTGGTACGCC GGCCAAACGG ACCTTCGACC CTGGGGTGGA
ACCGACCAGC GGCGGTACTA CGCCAAACCG GGGACCGTGG TGTCGTTCGA CCCGGTCAGC
GGCGCCCTGG CGCCGGCCTA TGCGATCCCG AACAGCGCCC CGGGAACCGT GCTGCGCGCC
AGCGACTTCA CCGCGGGGCA AAATCTCGAG AATTGGCGGG CCGGATACGA TGTCCTCCCG
GCCCAACGTC GCGACAGCGT CTTTCTCGCC GCGAGCCAGG ATCTTGGCGC GCAGGTCACG
GTCTCCGGCG ACCTGCGCTA CTCCGACCGG CGCTTTCACG CCACGGGCTT GGCGTCTGAT
AGCCTGATCT TCGTCACCCC CGATAACCCC TGGTACGCCT CGCCGACGAA CGCTCCTTCT
GAGATCGTCG CCTACTCCTT TCTCGACGAA TTGGGCGGCG TGCGCAGCCG CGGCTCGGTG
CGCAGCCTCG CCGCCTCGGT CGGGCTTGAG GCACGCCTTC CACACGACTG GCGGCTGACG
ACCTATGTCG CGCACGCCGA GGACCTGTCG CACACCCGCG GCGACAACGT CGTCAATCCC
ACCCTGTTGG ACGAGGCCTT GGGGGCCACG CCCGACGATC CGGCCACGGC CTTCAGCGCC
GCGCGCGACG GCTATTTCAA TCCCTTCATC GGCCAGGGCG CCAACAGCAG GACCGTGCTC
GACTTCATCA GATCCGGCTA CGAGACGCGC CGCACGCTGG GGGAGACCGA CAGTTTCAGT
CTGCAAGCGG ACGGCGCTCT GGCCACCCTG CCCGGCGGAC CTTTGCAGGC GGCCGTCGGC
GTGCAATTTC GCCGCGAGCG TCTGGACACC GGCGGCACGA GCTTCGTCGG CGGAACCGCG
CCGCGCGCCG GCTTTTCGCG AAAAGGCGAG CGCACCGTCA GCGCGGGTTT TGTCGAGCTG
CGGGTTCCGC TGGTCGGCGA TGCAAACCGC CGGGCCGGCA TCGAGCGCCT GGAGCTTTCG
GCCGCCGGGC GGATCGAGTC CTACGATGAC GTGGGGACCA GCACCGTTCC GAAGTTCGGC
TTGGTGTGGA AGCCGATTGG CGACCTCACG GTTCGAGGCA CCTACGGCCG GGCCTTTCGC
GCGCCCTCGC TGGGGGAACT AAACGACAGG TTCCTGATCA CCCCGGTGTT CCTGACGCGT
GGCGCTGACA CCGTGCTCAG CCTGCTGCTG TTCGGGGGCA ATCCCGAGCT CAAACCCGAG
ACCGCCAAGA CTTGGACGGC GGGCTTTGAC TGGACGCCAC AGGCCCTGCC CGGCCTGAAG
GTGTCGGCGT CAACCTTCGA AACTCGCTTC AAGGACCGTA TCGGCCAACC GGCCAATGAT
AATCTTGGCA TCGTGCTGAC GGCTGACGAG TTCGCCGCCT TTCGCCGGTT CGTCGATCCG
GCGGGCAACG CCAGCGATTT GGCGCTCGTG CAAGGGCTGA TCGACGACCC GGCTTCGCGC
GCCAAGGGCC TCTTTCCGGC CGTCGCCTAC GGCGCGATCG CCGACGCGCG CTATGTCAAC
ACCGCCGCCC TCACGGTGCG GGGTGTCGAC CTGTCAGCGC GCTACGGCCT GAGCCTCCAC
GGCGATCCTC TTGATCTGGA CGCCAGTCTG ACCTGGCTGA CGGATTTCAA GCGGCAGACA
ACCGCCGCGG CGCGGCCCGT CGATCTCGCC GGGCAGACCG GATCCCCAGC CGATCTACGC
CTTCGCCTCA CCGCGACCTG GACCCATGGC CCGCTGGCGG CCACCGGCAC GGTAAACCGG
GTGGGCGATC TTCAAGCCGA AACCGGCGAA CGCGTGGCGT CCTGGACGAC GGTCGACGCC
CAGGTCCGCT GGACCGCGGC TGCAAACAGC CGGCTGGAAG GCCTGACCGC CGCGCTGAGC
GTCACCAACC TCTTCGACCG CGATCCGCCG TTCTACAACT CCCCTCTAGG CCTGGGCTAC
GACCCGGCCA ACGCGGACCC GGGCGGCCGT CGAGTGAGCC TTCAGCTCAC CAAGGCCTGG
TAG
 
Protein sequence
MKPVLGALLI GASVAILAAA ATPTSATAAE PTIALNLPAG PMQKSLVALA TQADVKILFE 
MDLVAGLTAP ALQGQFTPRQ AVERLLAGSG VAVDQVRPGV LVLRPARLGA SAEAAAFPGG
ALGGEPAQAD ETLLSEVVVG SHIRGAQGAS PIVTFDRNAI DQGGYATLAD ALTALPQAFG
GSVSDDTGAT GADTTGVNTA RATAVNLRGL GADSTLVLVD GRRMAGAGLK GDFADVSSLP
LVAVERVEVL LDGASALYGS DAVGGVVNIV MRKDYEGAET RLTAGGSTRG DLRQVSIAQT
FGTRWASGHA LISYEHQDRE ALAGRRRWYA GQTDLRPWGG TDQRRYYAKP GTVVSFDPVS
GALAPAYAIP NSAPGTVLRA SDFTAGQNLE NWRAGYDVLP AQRRDSVFLA ASQDLGAQVT
VSGDLRYSDR RFHATGLASD SLIFVTPDNP WYASPTNAPS EIVAYSFLDE LGGVRSRGSV
RSLAASVGLE ARLPHDWRLT TYVAHAEDLS HTRGDNVVNP TLLDEALGAT PDDPATAFSA
ARDGYFNPFI GQGANSRTVL DFIRSGYETR RTLGETDSFS LQADGALATL PGGPLQAAVG
VQFRRERLDT GGTSFVGGTA PRAGFSRKGE RTVSAGFVEL RVPLVGDANR RAGIERLELS
AAGRIESYDD VGTSTVPKFG LVWKPIGDLT VRGTYGRAFR APSLGELNDR FLITPVFLTR
GADTVLSLLL FGGNPELKPE TAKTWTAGFD WTPQALPGLK VSASTFETRF KDRIGQPAND
NLGIVLTADE FAAFRRFVDP AGNASDLALV QGLIDDPASR AKGLFPAVAY GAIADARYVN
TAALTVRGVD LSARYGLSLH GDPLDLDASL TWLTDFKRQT TAAARPVDLA GQTGSPADLR
LRLTATWTHG PLAATGTVNR VGDLQAETGE RVASWTTVDA QVRWTAAANS RLEGLTAALS
VTNLFDRDPP FYNSPLGLGY DPANADPGGR RVSLQLTKAW