Gene Caul_4751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4751 
Symbol 
ID5902213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5136028 
End bp5138415 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content66% 
IMG OID641565270 
ProductTonB-dependent receptor 
Protein accessionYP_001686369 
Protein GI167648706 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0405819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTTG AGATCCATCG CCCGCGCGCG CGTCGCGCGC TGCTGGCCGC GACTTCGCTG 
GTCGCCGTCG TCGCCTTGCC CATGTTGGCC AGCGCCCAGG ATCGGTCGGA CACCGTCGAG
GAGCTGATCG TCACCGCCAC CAAGCGCGAC GCGACCATTC TGGATGTGCC GTTCTCGATC
AACGCCCAGA CCGAGGCCGA CATTCAGAAG TCTGGCGCGG TCACCCTCGA GGACCTGTCG
CGCAACGTCG CGGGCCTGAC GATCCAGAAC CTCGGCCCCG GCCAGAGCCA AGTGTCGGTG
CGCGGCGTCT CGGCCGGCCA GGTGGTCCGC GACCAGCCGG GCGTGAAGGA GCAGGTCGGG
GTCTATCTCG ACGAGTCGGT GATCTCGCTG TCGCTGTTCA CCCCCGACAT CGACCTGTTC
GACCTCAACC GGGTCGAGAC CTTGCGCGGC CCGCAGGGCA CGCTGTTCGG CTCCGGTTCG
GTGGGCGGCA CGATCCGCTA CATCACCAAC CAGCCCGTGA TCGGCGACTA TAAGGGTACT
GTCGAGGCCA ATCTCAACAC CCTGAAGGGC GGCGACGTGG GCGGCTACGT CAAGGGCGCG
GTCAATATCC CCGTCTCGGA CAAGGTCGCG TTGCGGGCGG TCGGCTATGA CACCGAATAC
GGCGGCTTCG TCGACGCCCT GGGCGAGGGC GGGACCAAGA AGAACAACGT CAACGACGGC
TATCGTCGCG GCGGCCGGCT GTCGTTGCTG TTCAAGCCGA CCGACGACAT CAAGATCACC
CCGCGCGTCG TCTATCAGAA GATCCACGCC GGCGGCTTCA ACCGCCAGGA AGCCTTCAAC
CTGTTCGCCA ACCCCTACAC CACCACCCGG CCGGCGATCA CCCTGGGCGA GCGCCAGCAG
TACCTGCTGC TCGACGAGAG CTTCGACGAC AAGACCTTCC TGGCCGACCT GACCGCCGCG
TTCGCCTTCG ACGGCGTCGA ACTGACCTCG GTGACCAGCT ATATCGACCG CAAGATCGAC
GTGAACCGCG ACGCCAGCGC CCTGACCGGC AGCGTCTCGG TGGACCTGGG CTTCCCGGCC
GCCGCCGTCA CCTTGCCCTC GAAACTGGTC GACACCACGG ACCTGGAACA GTTCACCCAG
GAAGTGCGCC TGGGCTCGCG AACCGACAGC CCGTTCCAGT GGGTGGTTGG CGCGTTCTAT
TCCAAGGTCG ACCGGGTCTA TAACCAGCGC CTGCCGACGC CGGGCTACGA CGCCTATACC
GACGCCACGC TGGGGGCCGG AACCTCGGCG CAAGTGGCCA ACGGCTTCCC GGCCAATTCG
CCCTACAACG CGTCCCTGCC CTACAACATC AAGCAGAAGG CGGTGTTCGG CGAAGCCAGC
TACGAGATCG ACAAGCTGAC CGTGACGGCC GGCGGCCGCT ATTACGACTT CAAGGAAAAC
CGCCGCTTCA CGTCGGGCGG CCTGTTCGCC AATGGCGACG ACCAGACCGA CAAGACCTCG
TCGGATGGCT TCACCCCGCG CCTGCTGGTC AGCTATAAGG CCAATCCGGG CCTGACCTTC
AACGCCCAGG CGTCCAAGGG TTTCCGATTG GGCGGGGTCA ATGATCCGCT GAACATCCCC
CTCTGCACCC CGCAGGACGC GGCGATCTTC GGCGGCTTCC AGTCCTATGA CGACGAGACG
CTGTGGAACT ACGAGGGCGG GGTGAAGTCG CGGTTCGGCG GCGTCACCTT CAACGGCGCC
GTGTTCTATA CCGACATCAA GAATCTGCAG ACGACGCTCG ACGCCGGCTC GTGTTCGTCG
CGCGTGGTGT TCAACGTGCC CAAGGCCCAC ACCAAGGGCA TCGAGGGCGA GCTCACGGCC
CACCCGGCGC CGGGCCTCCA ACTCGGCGTT TCGGGGAGCC TGCTGGAGGC CGAATTCGAC
TCCACGGTGA GGGATGGCGC GGGCGCGGTG ATCGGCGGGA TCCGCGAGGG CAACCGCCTG
CCCTCGGTGC CCAAGTTCCA AATCTCGATC AACGCCACCT ACACGCGGAC CCTGACGGCT
GCCATGGACG GCTATGTCAC CGCGTCGTTC CAGCACGTCG GCAACCGCTA CACCCAGGCC
AGCGACCAGG AGAATAATCC GCGCGCCTTC GTCTCCGGCT TGCCGTTCGG CGGGGCGACG
GGAACCCAGG CCACGGTTCT AGACCTGCAA CTGCCCAGCT ACGACCTGGT CAATCTCAGC
GCCGGCCTGC AGATGGACAG CGGCCTGGAC GTGATCGCCT ACGTCAACAA CGTGTTCGAC
GAGAACCCGC TGCTGTCGTT CGACCGCGAA CGCGGCGGCC GGGCGCGCCT GGGCTATGCG
ATCGGCCAGC CGCGCGTCAT CGGCCTGACG GTGCGGCAGT CGTTCTAG
 
Protein sequence
MRLEIHRPRA RRALLAATSL VAVVALPMLA SAQDRSDTVE ELIVTATKRD ATILDVPFSI 
NAQTEADIQK SGAVTLEDLS RNVAGLTIQN LGPGQSQVSV RGVSAGQVVR DQPGVKEQVG
VYLDESVISL SLFTPDIDLF DLNRVETLRG PQGTLFGSGS VGGTIRYITN QPVIGDYKGT
VEANLNTLKG GDVGGYVKGA VNIPVSDKVA LRAVGYDTEY GGFVDALGEG GTKKNNVNDG
YRRGGRLSLL FKPTDDIKIT PRVVYQKIHA GGFNRQEAFN LFANPYTTTR PAITLGERQQ
YLLLDESFDD KTFLADLTAA FAFDGVELTS VTSYIDRKID VNRDASALTG SVSVDLGFPA
AAVTLPSKLV DTTDLEQFTQ EVRLGSRTDS PFQWVVGAFY SKVDRVYNQR LPTPGYDAYT
DATLGAGTSA QVANGFPANS PYNASLPYNI KQKAVFGEAS YEIDKLTVTA GGRYYDFKEN
RRFTSGGLFA NGDDQTDKTS SDGFTPRLLV SYKANPGLTF NAQASKGFRL GGVNDPLNIP
LCTPQDAAIF GGFQSYDDET LWNYEGGVKS RFGGVTFNGA VFYTDIKNLQ TTLDAGSCSS
RVVFNVPKAH TKGIEGELTA HPAPGLQLGV SGSLLEAEFD STVRDGAGAV IGGIREGNRL
PSVPKFQISI NATYTRTLTA AMDGYVTASF QHVGNRYTQA SDQENNPRAF VSGLPFGGAT
GTQATVLDLQ LPSYDLVNLS AGLQMDSGLD VIAYVNNVFD ENPLLSFDRE RGGRARLGYA
IGQPRVIGLT VRQSF