Gene Caul_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1862 
Symbol 
ID5899317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1996051 
End bp1998894 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content69% 
IMG OID641562352 
ProductTonB-dependent receptor plug 
Protein accessionYP_001683489 
Protein GI167645826 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.248067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCC ACCTCTGTCG CGCCACGCTG CTGGCGCTCA TCTCGGTCGG CTCGGTCTCG 
ACCGCCGGCC GCGCCCAAGG CCGGACAGAA TTGGGACCCG CCCTCACCCG GCTCGCCGTC
GAACGCAATG TCCAGATCCT TTTCCAGCCG CATCTTGTCG AAGGCCTGGT CGCCAATCCG
GTTCGGCGCG GGACCAGCCT GGACCAGGCC ATGACCATGA TCATTGGCCG CCAGGGCCTC
AGGATCCGCA AGGTGCGCGC GGGGATCTAT GCGGTCGAGC CAGAAATCAG GAGCATTTCT
CCACCGGCTT CCCTCTCCAA GGAAGACAGC CCGAGCGTTG TCGCCCCGCT GATCGTCACC
GCCGTCTACG CCGCCAGCCT CGAGCGGACC CTTGCCCTCA AGCGGGACGC GACCCACGGC
CTGGACGCGG TCAGCGCCGA GGACATCGCC CGCCTGCCCG CCGCCAACGC CGCCGAGGCC
CTGCAACTGG CGCCGGGCGT GAGCCTGGAG CGCCATCGCG GCGTGGGCCT CTATGTCAGC
GTCCGGGGGC TTGGGCCGCA GTTCCAGAAC GTCCTGCTGA ACGGCCGGTC GATCGCCATC
AACGACCTGG TCGAGAACGG GGGGTTTCGC GGCCGGCAGT TCCGCTTCGA GGTCCTGCCC
TCCGACGTCA TCGACCGCAT CGAGGTCATC AAGACCACCA CCGCCGACAT GGACGAAGGG
GCGCTGGGCG GCAATATCGA CGTGCGGACC TTCAAGCCGC TGGAGCGCGG CCCCCGCGCG
GTGCTGTCGG CCCGGGCCTC GCAGGGCCAG GCCGGCAAGC CGGACCCCGC AGTGTCGGGG
GTCTGGAGCT GGGTCTCCCC CGACGGTCAT CTGGGCCTGC TGGCCGCGGG CATGGCCGAG
CGCCGCCAGA TCCGTAACGA CCGCCTCTAC CAGACGGGCT GGAACCTCGA CCGCTTCACC
AATGTCCTGC CCGCCGGCCT GTACACGCCA ACGCGCACGC GGCCGACCAT CGAGCTGGAA
GACCGCCGTC TGATGTCGGG CGACTTCGCC CTGCAGTGGC GGCCCTCGCC CGACTGGCGG
ACCGATATCG ACCTGCTGGT GACACGCCTG GACGCCCACT ACGACGAATT CGGCCTGGAC
ATCTATCCGG ACGACACCAC CTTCGCCCAC CCCGCCTTCG TGGCTGGCAG CCAGAGGGTG
GTCGGCGACA CCGTTCAGGC CGGCCAGATC GACAACGTGC GCTGGATGGC GTCGCGCGAG
ACGAGCCTCA ATCGCCACGA CTTGGCGGCC TTCGGCGTCC GGCAAAGCTG GACGCCGGGC
GCGTGGGCGC TGGACATCGA CTACGCCTAC TCCCGAGCCC GCAGCTATCA TCCCGACGGC
CAGGGCACTG TGCGGGCGCG CGCCGCCTTC TTCGCCCCGT TGATTTACGA CTTCGGCGGC
GGCCTTCACA GCGCGCCGAC GCTGAAGACC ACGATCGACT ACACCGACCC CGCGCGGTTC
GTCGGCCAGG CGTTCGACTA CACCTGGAAG GACTCGCGCG ACACCGACGA GGCGCTGAAA
GCCGACCTCG CCCGGTCGTT GGGGGCTGGC AAGCTCAGCC TTGGGGTCGA GGGTCATCGG
CGGGTGCGCG ACTATCGCCG GCGCGACTGG ATTCTCAACA CGGTCGTCGG CGCGCCCCTG
ACTAGCCTGG GCGGTGAGTA TTACGGCCAA ACTCCCGTCT CGGACTATCT GGCCGGCACG
CGGGGCGAGC TGCCGCGCCA CTGGGTGGCC CTGGACGCCC GCGCCTTCTA CGAGCAACTG
TTCACGGAAG AGATCGCAGC CTTGCCCCCG ACGGTGTCTG ATCGGCGCAA TTCGTTCGTG
GTCGAGGAGA AGATCCTCTC GGCCTATGCG CGCGGCGACT TCTCGGCCCG CTGGTTCGGC
CTGCCGGTCG ATGGCGACGT GGGCGTTCGC TACGCGAGCA CGCGGCAGAT CTCGACGGGC
GTGCTGTCCA GCGGCGCCGA ACCGATCCCC GCCCAGTGGC GCAAGGCCTA TGGCAACTGG
CTGCCCAGCG CCAATTTGCG CGTCACCCTG ACGCCGGACC TGCTGCTGCG GCTGGCGGCC
TCGCGGGTGG TCAACCGCCC CAACGTCGTC GACAACGCCC CGCGCATCAC CCTGGCCCGC
GACACGCCGA CCGCCAACGG CGGCAACCCG GACCTCGACC CGTTCCTGGC CACCCAGTTG
GACGCCTCGC TGGAGTGGTA CTTCCCGTCC GGCGGCGCCC TGACCGGCGC GGTGTTCGAC
CGGCGGCTCG ACAACTACAT CACTGCCCAG AACACTTTCA TCCAGGTTCC CGGGCGCGGC
GAAATCCTGC TGTCGACCAA CGTCAACGGC GGCGACGCCC GCATCCAGGG TCTGGAGCTG
GCCTACAGCC GAACCTTCAA AAGCCTGCCC GCGCCGCTGA ATGGCCTGGG CATGCAGGGA
TCGCTGACCC TGGTGCGCAG CCAGGCCAAC TATTTCGCCG GCGACCGGGT GATCCGCAAC
GCCCTGCTGG GCCTGTCACG CACCAACTAC AGCCTGCTGG CCTTCTACGA GCGCGGCCGC
GCCTCCGTGC GACTGGGCTA CAATTGGCGC GGCGCATACC TGACCACGAT CGGTAGCTCG
ATCACCGCCC CGGCCACCAC GGCGGCCTTC GGGTCGTTGG ACGGCGCGGC GTCCTGGCGG
GTCAATCGGC GGGCGACGAT CACCTTCGAG GGCGTGAACC TGGCCGACGC GCGGCGCTTC
GTCTATGGCG AGAGCCGCGA CCAGCCGATG GAAATTCATC ACTGGGGCCG ATACCTGTCC
ACGAGGCTGC GATGGGCGTT CTGA
 
Protein sequence
MPRHLCRATL LALISVGSVS TAGRAQGRTE LGPALTRLAV ERNVQILFQP HLVEGLVANP 
VRRGTSLDQA MTMIIGRQGL RIRKVRAGIY AVEPEIRSIS PPASLSKEDS PSVVAPLIVT
AVYAASLERT LALKRDATHG LDAVSAEDIA RLPAANAAEA LQLAPGVSLE RHRGVGLYVS
VRGLGPQFQN VLLNGRSIAI NDLVENGGFR GRQFRFEVLP SDVIDRIEVI KTTTADMDEG
ALGGNIDVRT FKPLERGPRA VLSARASQGQ AGKPDPAVSG VWSWVSPDGH LGLLAAGMAE
RRQIRNDRLY QTGWNLDRFT NVLPAGLYTP TRTRPTIELE DRRLMSGDFA LQWRPSPDWR
TDIDLLVTRL DAHYDEFGLD IYPDDTTFAH PAFVAGSQRV VGDTVQAGQI DNVRWMASRE
TSLNRHDLAA FGVRQSWTPG AWALDIDYAY SRARSYHPDG QGTVRARAAF FAPLIYDFGG
GLHSAPTLKT TIDYTDPARF VGQAFDYTWK DSRDTDEALK ADLARSLGAG KLSLGVEGHR
RVRDYRRRDW ILNTVVGAPL TSLGGEYYGQ TPVSDYLAGT RGELPRHWVA LDARAFYEQL
FTEEIAALPP TVSDRRNSFV VEEKILSAYA RGDFSARWFG LPVDGDVGVR YASTRQISTG
VLSSGAEPIP AQWRKAYGNW LPSANLRVTL TPDLLLRLAA SRVVNRPNVV DNAPRITLAR
DTPTANGGNP DLDPFLATQL DASLEWYFPS GGALTGAVFD RRLDNYITAQ NTFIQVPGRG
EILLSTNVNG GDARIQGLEL AYSRTFKSLP APLNGLGMQG SLTLVRSQAN YFAGDRVIRN
ALLGLSRTNY SLLAFYERGR ASVRLGYNWR GAYLTTIGSS ITAPATTAAF GSLDGAASWR
VNRRATITFE GVNLADARRF VYGESRDQPM EIHHWGRYLS TRLRWAF