Gene Caul_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1843 
Symbol 
ID5899298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1960245 
End bp1963496 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content64% 
IMG OID641562333 
ProductTonB-dependent receptor 
Protein accessionYP_001683470 
Protein GI167645807 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000198025 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000327731 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACTATG TCCAAGGCGC GTCCGCGCGT GCACGTTCAA CTTACAATTC AAGACTCAGG 
TACGGTGTTT CGGCCCTCGC GCTCGGCGCC GTGCTGATCG GAGCGCCGGC CCTGGCTCAG
ACCAAGCCGA CCGATGACGC GCAGACGGTC GACGAGGTTA TCGTCACCAG CATCCGCCAG
AGCCTGAAGA GCTCGCAGCA GCTCAAGCAG AGCTCCGAGA TCATCGGCGA CTCGATCACC
GCCGAGGACA TCGGCGCCCT GCCGGACCGT TCGGTCACCG AGGCGCTGCA ACGCATCCCG
GGCGTGGCCA TCAACCGCTT CGCCGCGGGC GTCGATCCCG ACCACTTCTC GGCCGAAGGC
AGCGGCGTCG TCGTGCGCGG CCTGAACTTC GTGCGTTCCG AGCTGAACGG CCGCGACACC
TTCTCGGCCA ACAACGGCCG GGCCCTGAGC TTCGCGGACG TGCCGTCGGA GCTGATGGGC
GGCGTCGACG TGTTCAAGAG CCCCTCAGCC GACATGATCG AAGGCGGCAT CTCCGGCACC
GTCAACCTGC GCACCCGTCT GCCGTTCGAC AGCAAGAAGC GTCTGTTGTC GCTGTCGGCT
GAAGAGAGCT ACGGCGATTT CGTCAAGAAG TGGGCGCCCA CCTATTCGGC GCTCTACAGC
GACCAGTGGG ATACCGAGGC CGGTACGTTC GGCCTGCTGC TCAGCGCCGT CGACTCCAAG
CTGTGGACCC GTTCGGACGG CACCCAGGTG TCGAACTTCG GTTGCCGCAC CAACTTCACC
AGCGCCCAGA CCGCCAATCC GCAGGCCGTC ACCTGCCCGC AAGGCGGCAA GGGCGTGTGG
TTCCCGCGCG GCGCGGCCTT CCGCAGCACC GAGACCGAGC GCGAGCGCAT CGGTTACGCG
GCGGCCGGCC AATGGCGCAG CAACGACGAC ACCATGCTGG CCACCTTCCA GTACCTGCGT
TCGGAATCGC AGCAGTCCTG GACCGAGCAC GCGATGGAAA TCGCCACCGA CAACGTCCTG
GCGGCGGGCG ATTCGCGTCC GATCGACGGC ACCACCTTCG GCGTTGACAG CAACGGCATC
TTCACCAACG GGATCATCAC CGGTCCCCAG GGTTGGAGAG ACGACCAGAA CAGCGCCGAC
CCGCGCACGC CCAGCTTCGG CCTGCAGAGC AACAACATCT CTCGCAGCGT CGAGCAGAAG
TATGTGACCT CGGACTACGG CTTCAACTTC AAGTGGACGC CGACTGATCG CATCGGGGTG
GCCTTCGACT ACCAGCACGT CGACTCGACG GTCGACAACC TGGACGTCGG CATCTGGGGT
TCGAGCTTCC AGAACCTGGA TCTGAAGCTC AACGGCTCCG ACATGCCGGT GTTCAGCTTC
ATCCCGCCGG CCAGCGGCGC GTCGATCCCG CAATGCTCAC CGCCCAGCGG CAGTTGCTCG
ACCTACCTCC GCGCGCCGTA CGACCACTTC CAGAACCCGC ATAACAGCTT CTGGCGTTCG
GCGATGGACC ACATCGAGCA GAGCGAAGGC AAGGAAGACG CCGCCAAGAT CGATGTCGAC
TACCGTTTCG CGGACGACAG CAGCTGGCTG GATTCGGCCC GCGTGGGCGT GCGCTGGGCC
GAGCGCGACC AGACCACCCG CTTCTCGACC TATAACTGGG GCGTGCTGAG CGAAATCTGG
GGCGGCGGCG GTCCGGTGTG GTTCGACGAT CCGGTCAACG GCAATCCGGC TACGGCAGGC
GGCGAGACCA GCGTGGCGCG CACCGAACTC TATCCGTTCA CCGACTTCAT GCGCGGTCAG
GTCCCAGCTC CGACCGGCCT GGACGCCCGT CCGTTCTACC TCGGCAACAC CGCCACGGAC
TATGCGGGTC TCCAGGCGTT CGCCCTGAAG ATCGGCGACG AGTGGCGCCC GCGCGTCGCG
GCGGGCTCAA CCTGCCCGCA GAACTGGGTC CCTCTGGCCC AGCGCTGCAA TACGGTCGCC
GGAACGCCTT TCCTGCCGGG TGAAATCAAC CCGATCAACG AGAAGACCAA GTCGGCCTAC
GCCATGCTGC GCTTCAAGCA TGAGTTCGAC GGCGACGTTA AGGTCACGGG CAATATCGGC
CTGCGCTACA CCAGCACCAC CCGGGATGCG ACAGGCTTCC TGACCTTCCC GAACACGGTC
CCGGCGACCG ATGCCTCGTG CGATCTCTCC TTCACGAACT GGCAGGCCCA GCCGGATCCC
AAGGATCCCT TCGTGCCCTC GGCGTTCTGC GCCCTTTCGC CCACCGCTCG CCAGAGCGTG
CGCAACTTCA ACAACGGCGC CACGGTCGCG CAATCCGCCC ACGCCAAGTT CACCTACTGG
CTGCCCAGCG TGAACCTGAA GGTGGCCTTG AGCGACGGCT GGCAACTGCG CTTCGCCGCC
TCCAAGACGA TCACCCCGCC GGAAGTCGGG CTGACGCGCA ACTATTACGA CGTCAAGCTC
GACACTAACT CGACGGGCAT CATCAACGGC GTGGTCGGCG GCAACACCAC GGTCGGCAAC
CCGTATCTGA AGCCCACGCA GTCGATCAAT ATCGACGGCT CGGCGGAATG GTACTTCGCG
CCGGTCGGCT CGGTGACCCT CGCTCTGTTC TGGAAAGAAC TGACCGATGT GGCCACCAAC
ACCACCGCGC GGATCCCGTT CACCAACAAC GGTTCGACCT TCGCCGTCGC GGTGACCACG
CCCGGGAATT CGGACGTCAA GGCCCACGTC AAGGGCTTCG AGATCGCCTA CCAGCAGTTC
TACGACTTCC TGCCCAAGCC GTTCGATGGG TTCGGCATCA ACGCCAACTA CGCCTATATC
GACAGCAAGG GCGTGCCGCA AAGCACGCTG TCGGCCACCG ACCCCGACGT GGGCGCTGGC
CGGGTCTCGA CCGTCGACAC TGGCCTCTTG CCGCTGCAAG GCCTGTCCAA GCACAACGTC
AACTTCGCCG CCATCTATGA GAAGGGTCCG ATCTCGGCCC GTCTGGCCTA CAATTGGCGT
TCGGACTTTC TGTTGACTGT CCGTGACGTG ATCGTGCCCT TCGCACCGAT CATGAACGAG
GCTACCGGCC AGTTGGACGG ATCGCTGTTC TACACGATCA ATCCGAAGGT GAAGATCGGC
GTGCAGGGCG TGAACCTGAC CAACGAGGTC ATCAAGACCA CCCAAGTGTT GAACAACGAC
CTGCTCAAGG CCGGGCGGTC GTGGTTCATG AGCGACCGTC GCTACACCTT CGTGTTGCGC
GCCAGCTTCT AG
 
Protein sequence
MDYVQGASAR ARSTYNSRLR YGVSALALGA VLIGAPALAQ TKPTDDAQTV DEVIVTSIRQ 
SLKSSQQLKQ SSEIIGDSIT AEDIGALPDR SVTEALQRIP GVAINRFAAG VDPDHFSAEG
SGVVVRGLNF VRSELNGRDT FSANNGRALS FADVPSELMG GVDVFKSPSA DMIEGGISGT
VNLRTRLPFD SKKRLLSLSA EESYGDFVKK WAPTYSALYS DQWDTEAGTF GLLLSAVDSK
LWTRSDGTQV SNFGCRTNFT SAQTANPQAV TCPQGGKGVW FPRGAAFRST ETERERIGYA
AAGQWRSNDD TMLATFQYLR SESQQSWTEH AMEIATDNVL AAGDSRPIDG TTFGVDSNGI
FTNGIITGPQ GWRDDQNSAD PRTPSFGLQS NNISRSVEQK YVTSDYGFNF KWTPTDRIGV
AFDYQHVDST VDNLDVGIWG SSFQNLDLKL NGSDMPVFSF IPPASGASIP QCSPPSGSCS
TYLRAPYDHF QNPHNSFWRS AMDHIEQSEG KEDAAKIDVD YRFADDSSWL DSARVGVRWA
ERDQTTRFST YNWGVLSEIW GGGGPVWFDD PVNGNPATAG GETSVARTEL YPFTDFMRGQ
VPAPTGLDAR PFYLGNTATD YAGLQAFALK IGDEWRPRVA AGSTCPQNWV PLAQRCNTVA
GTPFLPGEIN PINEKTKSAY AMLRFKHEFD GDVKVTGNIG LRYTSTTRDA TGFLTFPNTV
PATDASCDLS FTNWQAQPDP KDPFVPSAFC ALSPTARQSV RNFNNGATVA QSAHAKFTYW
LPSVNLKVAL SDGWQLRFAA SKTITPPEVG LTRNYYDVKL DTNSTGIING VVGGNTTVGN
PYLKPTQSIN IDGSAEWYFA PVGSVTLALF WKELTDVATN TTARIPFTNN GSTFAVAVTT
PGNSDVKAHV KGFEIAYQQF YDFLPKPFDG FGINANYAYI DSKGVPQSTL SATDPDVGAG
RVSTVDTGLL PLQGLSKHNV NFAAIYEKGP ISARLAYNWR SDFLLTVRDV IVPFAPIMNE
ATGQLDGSLF YTINPKVKIG VQGVNLTNEV IKTTQVLNND LLKAGRSWFM SDRRYTFVLR
ASF