Gene Caul_1136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1136 
Symbol 
ID5898591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1204250 
End bp1206547 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content67% 
IMG OID641561618 
ProductTonB-dependent receptor 
Protein accessionYP_001682764 
Protein GI167645101 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.438205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000214411 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAGAC ACAGGAACTA TCGCCTGCTG GCCGGCGGAT TGCTGAGCGT CAGCGCCGCC 
GCCCTGCAGG GGATCACGCC CGCTCAGGCT CAGGCTCAGG CCCAGACTCA GGCCCAGCCA
GCGAGCGTTC AGATCGAGGA AGTCGTGGTC ACCGCCCGCC GTCGCGAGGA ACGGTTGCAG
GACGCGCCGG TGGCCGTAAC CGCCCTATCC GGCGACGCCC TGCAGGCCCG CGGGGTCGAG
AGCGTCGACC AGATCGCCAG GTTCGCCCCC AGCATCCGCT TCGACGGCGC CGCGGCGCTG
AGCGGCGGCA ACTACAACGC CACCGTCTTT ATCCGCGGCG TCGGCCAGAA CGACTTCGCG
ATCTTCAGCG ACCCCGGCGT GGGCGTCTAT GTCGACGGCG TCTATTACGC CCGCTCGATC
GGCGGCACGA TGGACGCCTT CGACGTCAGC CGCATCGAGG TCCTGCGCGG CCCGCAAGGA
ACGCTGTTTG GCAAGAACAC CATCGGCGGC GCGGTGGTGA TCTCCACCGC CCAGCCCGGC
GATGCGTTCG GCGGCCAGAT CGAGGGCACG GTCGGCAGTC TGAACCGGCG CGACCTCAAG
GGCCATGTCG ACATCCCGCT GAGCGACAAG GCGGCCGTGC GCCTGTCGGC CGCCCGCCTG
ACGCGCGACG GTTACGGCAA GCGACTGCTG ACCGGCGAGG ACCTGGGCGA CCGCAACGCC
ACGGCGGCGC GGGCGCAACT GCGCTGGGAA GCCTCCGACG CGGTGATCGT CGAGCTGTCG
GCTGACTACA CCCGAGCCCG CGAACACTCG GCCCCTCAGA AGCTGCTGGT GATCGGGGCC
GTCCCCGGCT TTGCCGTCGG GCCGTTCATG GGCAACTTCA ACACCTATAT CGCCCCTGGC
CTGGGGATCA CCGCGCCCAA CGGCGCCAAG ACCCTCAACA CCTCGTTTCT GACCGACGAC
CCGTACACGA CCTACGGCAC CGGCCCCAAT GTCAACGACC TGGACCTCTG GGGAACCTCG
GCGACCGTGA ACTGGGACCT GGGGGGCGTC ACCTTCAAGA GCATCAGCGC CGTTCGCGGC
CTGAAGGCCA CGTTCGCCCG CGACGGCGAC AACACGCCCT TCACCTTCCG CGAGACCTTC
AACCACGACG TCCAGGCCCA ATACAGCCAG GAGTTCCAGT TCAGCGGCCA GTCGTTCGAC
GACAGGCTGA CCTGGGTGAC CGGCGCCTAT GTATTCCAGG AGCGCGGCAC CGATTCCGGC
TACGCCAAGC TGGCCCTGGG CCTGGCCCCA GGCGCCTCGC CGCCGCCCTA CAGCCCGTCG
GCGGCGGTCT ATACCAAGGT GACCAGCACC ACCTACGCCC TGTTCGCCCA GGGCAGCTAC
AGCCTGACCG ACCGGCTCAG CGCCACGGTC GGGGCGCGGG TCAATCGCGA TGACAAGGAC
TATGTCCTCG ACCACCGCCG CATCCGTGAC GGCGGGATCA TCGCCCAGCT CAGCCGGGGC
GGCTCGTGGG ACTCGTTCAC CCCCAAGCTG GGTCTGGAAT TCAAGGCCAC GCCCGACGTG
CTGCTCTATG TCTCGGCAGG CAAGGGCTTC AAGAGCGGCG GCTTCAACGC CCGACCGCTC
AACGACGCCA CGGAAGTCAC CGAATACCAG CCCGAGACCC TGCTGACCTA CGAACTGGGC
GCGAAGACCG CCTGGTTCAA TCGTCGGCTG ATCCTCAATC TGGCCGGCTA TTTCAGCGAC
TACCAGGACA TCCAGGTCAC GGTGAACCAG ACGCCGCGCA ACTTCGTGGC CAACGCCGCG
GCGGGCGAGG TCAAGGGCGT CGAACTGGAG CTTCAGGCCC GGCCGACGGC CAACTGGAGC
TTCAATCTGG GCGCCGGCTA CATGGACGCC AAGTACACCA AGGTGGGTTC GGGCCTGGCC
GCCGGCCAAG TGCTGCCGAT CACCCTGGCC ACCCACTTCG TCAAGGCCCC GGAATGGACG
GCCAACGGCG GGATCGAATA TTCCCGCGAA CTGTCGGTGG GCGGTCGCCT GCTCGCCCGC
GTCGACTGGA CCCACTACAG TACGGTCTAT AACGACGTGG CCAATGATCC GGACCTGACC
CAGCCGGCCT ATGACCTGTT CGGCGCCAGG ATCGCCTATA CCTCGGCCAA TCAGCTGTGG
CAGGCGGCGC TGTTCGGCTC CAACCTGTCG GACGAGCGCT ACAAGGTCTC GGGCAACGCC
TCCAGCGGCT TTGGCCTGAA GGAGGCCAGC TACGGCCGGC CGCGTGAATG GGGCTTGAGC
GTCAACCGAA AATTCTAG
 
Protein sequence
MSRHRNYRLL AGGLLSVSAA ALQGITPAQA QAQAQTQAQP ASVQIEEVVV TARRREERLQ 
DAPVAVTALS GDALQARGVE SVDQIARFAP SIRFDGAAAL SGGNYNATVF IRGVGQNDFA
IFSDPGVGVY VDGVYYARSI GGTMDAFDVS RIEVLRGPQG TLFGKNTIGG AVVISTAQPG
DAFGGQIEGT VGSLNRRDLK GHVDIPLSDK AAVRLSAARL TRDGYGKRLL TGEDLGDRNA
TAARAQLRWE ASDAVIVELS ADYTRAREHS APQKLLVIGA VPGFAVGPFM GNFNTYIAPG
LGITAPNGAK TLNTSFLTDD PYTTYGTGPN VNDLDLWGTS ATVNWDLGGV TFKSISAVRG
LKATFARDGD NTPFTFRETF NHDVQAQYSQ EFQFSGQSFD DRLTWVTGAY VFQERGTDSG
YAKLALGLAP GASPPPYSPS AAVYTKVTST TYALFAQGSY SLTDRLSATV GARVNRDDKD
YVLDHRRIRD GGIIAQLSRG GSWDSFTPKL GLEFKATPDV LLYVSAGKGF KSGGFNARPL
NDATEVTEYQ PETLLTYELG AKTAWFNRRL ILNLAGYFSD YQDIQVTVNQ TPRNFVANAA
AGEVKGVELE LQARPTANWS FNLGAGYMDA KYTKVGSGLA AGQVLPITLA THFVKAPEWT
ANGGIEYSRE LSVGGRLLAR VDWTHYSTVY NDVANDPDLT QPAYDLFGAR IAYTSANQLW
QAALFGSNLS DERYKVSGNA SSGFGLKEAS YGRPREWGLS VNRKF