Gene Caul_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0143 
Symbol 
ID5897855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp157133 
End bp159727 
Gene Length2595 bp 
Protein Length864 aa 
Translation table11 
GC content68% 
IMG OID641560628 
ProductTonB-dependent receptor plug 
Protein accessionYP_001681779 
Protein GI167644116 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.556848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.758309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAT CGTTCGACCG ACGCCTGGAT CGCCGTTTGA TTGGGGGCTC CACCGCAGTC 
GCGGCCCTGA TGCTCCTGGC CGCCCCCGTG GCCGTCCAGG CCCAGCAAGC CTTTCAGCAG
ACCAGGGGGG CCACCTACAG CTTCGACATT CCGTCCCAGG ACCTGAGCGG CGCCCTGCGC
GCCTTCGGCC AGGCCTCGGG CCAACAGCTG GCCTTCGACG AGGCCAGCGT GCGCGGCAAG
CGCGCCCCGG CCCTTCAGGG CTCCTACACC GCCGAAGCCG CGCTCAAGCG TCTGCTGAGC
GGCGCCGGCC TCAGCGCCCA GCCGACCGCC AGCGGCGTCT ACGCCATCCG CGCCAAGACC
CTGCTGGTCG CCACCAGCGC CCAGGCCGAA GCGCGGCCCG CCGCCGCCGC CGAACCCGAT
CCCGAAACGC CCTCGCAGGT CGAGGAAGTG ATCGTGGTCG GCACCCCCGG CGGCAAAGGG
ATCGACAAGC TGTCGGCCAG CTTCGCGGTC ACCACCGTCA ACGCCGACGA CATCACCAAG
GCCTCGCCCA AGAGCACGGC CGAGCTGCTG ACCCTGGTGC CCGGCGTCTG GGTCGAGACC
TCGGGCGGCG TGGCCGGGGC CAACGTCTTC GTGCGCGGCT TCCCCGCCAC GGGCGACGCC
GAGTTCCTGA CCATCCAGCT GCAGGGCTCG CCGATCTATC CGCCCTCGAC CCTGTCGTTC
CTCGAGAACA GCTCGATCTT CCGGGTCGAC GAGACCATCT CGCGGATGGA GGCCCTGCGC
GGCGGCCCCA ACCCGATCTT CTCCAACGGC CAGCCGGGCC TGACCACCAA CTTCCAGCTC
AAGGAAGGCG GCGAGGAGAC TCATGGCCTG GTCAAGATCT CGACCTCCGA CTACGACCTG
CGCCGCGTCG ACGGCCTGCT GAGCGGCAAG CTGGCCGACG ACCTCTATTT CATGATCGGC
GGCTACGCGA CCACCTCGCC GGGCGTGCGC GACACCCAGT TCGACAGCGA GGTCGGCCAG
CAATTCACGG TCAACATCAC CAAGCAGATG GAGCACGGCA AGCTCAACCT CTACGCCCGC
TACACCGACG ATCACGGGGC CTGGTACCTG CCGTTCGCCA TCAACGTGCC TGGCGTCGAC
AAGGGCGAAT ACACCCAGCT GGGCAACGCC AGCCGCTTCC ACACCCTGCA GATCAACGCC
GCCGGGACGA CCCGCAATTT CGACCTGGCC GACGGCCGGG GTTTCAAGGG CTATGTGGCC
GGCGGAAGCT TCGAGCACGA GTTCGGGGAC GACTGGACGG TGCGCGACCG CTTCAGCGTC
ACCGACGGCG ACGCCAACAC CTATGGCCTG GTGGGCGACG GCGCGGCGGT GACCGTGGCC
GCGCTCAACG CCGTGGTCCC GGGCGCGATC AAGACCCGCG GCGGGGCGAC CCTGGCCTCG
ACCGACTACG TGCAGAACTG GGGCGCCTGG ATCGTCGAGA AGAAGATCAA GGCGGTCACC
AATGACCTGT CGATCGCCAA GACGGTGGGC GCGCATCAGC TGTCGGCCGG CTATTACGCC
TCCAGCTTCA AGTCCGACGA CTTCTGGACG ATCGGCAATG TCGAGCCGAT GCAGGTCAAG
GCCAACGGCG ATTATCTGGC CGCCCCGGTC ACCTGCGCCG ACCTGGCCAC GGCCGGCAGC
ACCTCCAGCT GCTTCAAGTT CGGCCTGACC TCGGCCGGCG ACGGACGGGT GGACGCCCTG
TACCTGGCCG ATTCCTGGCA GGTGATCGAT CCGCTGCGCA TCGACCTGGG CGTGCGCCGC
GAGCGCTTCC GGACCGACTA CGTGGTCGAT GACGGCCCCG GCTTCCCCGA CGGCCTGGCC
CTGCGCACCA CCGACTACAG CAACAGCAAG ACCAGCTACA CGGTCGGCGT GAACTACGAC
ATCAACGCCG TCTCGGGCGT GTTCGGCCGC TATTCGCGCG GCTACAAGTT CCCCAGCTTC
GACAACTTCC GCGAAGGCCT GACCGACACG CTCAGCGTCG ACCAGTATGA GCTGGGCTAC
AAGCTGCGCA GCGGGCCGTT CGAGCTTTAC GCCACAGGCT TCGCCAACGA ATTCAAGGGC
GCCAAGTTCG CCGACGTCGG CGGCATCCAG GAGAGCAACA GCAACAAGGC CCACGGCGTC
GAGCTGGACG GCCGCTGGCG TTCGGACTTC GGCCTGTCCC TGGCCCTGAA CGGCACCTGG
CAGAAGACCG AGATCACCAA GTCGAGCATT CCGGCCAACA AGGGCAACAG CGCCCAGCGC
CAGCCCGACT GGATGATCCG CTTCACCCCC AGCTATGACG TGACCCTGGG TTCGGTGGAA
TCGACCTTCT ACGGCACGGT CAGCGCGGTG GACGATCGCT TCGCCGACAA CGCCAACACC
CAGGTGCTGA AGGGCTACAC CAAGATCGAC CTGGGCGCGA TCTTCGACGT GGCGGACTTC
ACCGTCCAGG TCTCGGCCGA CAACCTGACC GACAGCCACG GCCTGACCGA GGGTGATCCG
CGCTCGACCA GCGGGGCCAA CGCCCGCCCG ATCCTGGGCC GGTCGTTCAA GGTGAGTGTC
GGCTACAACT TCTGA
 
Protein sequence
MRRSFDRRLD RRLIGGSTAV AALMLLAAPV AVQAQQAFQQ TRGATYSFDI PSQDLSGALR 
AFGQASGQQL AFDEASVRGK RAPALQGSYT AEAALKRLLS GAGLSAQPTA SGVYAIRAKT
LLVATSAQAE ARPAAAAEPD PETPSQVEEV IVVGTPGGKG IDKLSASFAV TTVNADDITK
ASPKSTAELL TLVPGVWVET SGGVAGANVF VRGFPATGDA EFLTIQLQGS PIYPPSTLSF
LENSSIFRVD ETISRMEALR GGPNPIFSNG QPGLTTNFQL KEGGEETHGL VKISTSDYDL
RRVDGLLSGK LADDLYFMIG GYATTSPGVR DTQFDSEVGQ QFTVNITKQM EHGKLNLYAR
YTDDHGAWYL PFAINVPGVD KGEYTQLGNA SRFHTLQINA AGTTRNFDLA DGRGFKGYVA
GGSFEHEFGD DWTVRDRFSV TDGDANTYGL VGDGAAVTVA ALNAVVPGAI KTRGGATLAS
TDYVQNWGAW IVEKKIKAVT NDLSIAKTVG AHQLSAGYYA SSFKSDDFWT IGNVEPMQVK
ANGDYLAAPV TCADLATAGS TSSCFKFGLT SAGDGRVDAL YLADSWQVID PLRIDLGVRR
ERFRTDYVVD DGPGFPDGLA LRTTDYSNSK TSYTVGVNYD INAVSGVFGR YSRGYKFPSF
DNFREGLTDT LSVDQYELGY KLRSGPFELY ATGFANEFKG AKFADVGGIQ ESNSNKAHGV
ELDGRWRSDF GLSLALNGTW QKTEITKSSI PANKGNSAQR QPDWMIRFTP SYDVTLGSVE
STFYGTVSAV DDRFADNANT QVLKGYTKID LGAIFDVADF TVQVSADNLT DSHGLTEGDP
RSTSGANARP ILGRSFKVSV GYNF