Gene Caul_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2142 
Symbol 
ID5899597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2317924 
End bp2321028 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content64% 
IMG OID641562632 
ProductTonB-dependent receptor 
Protein accessionYP_001683768 
Protein GI167646105 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000142845 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0159239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAATC CACGAAACAG GTCCGCCTTG GCGCGGGGCA TGACGCTGGC GCTGCTGGCG 
GGCGCCTCGT CCACGGCCTT GGCCACGGGC GCCTGGGCCC AGACCGCGCC GGACGCGGGG
TCCCAGGTCG AGGAAGTCGT CGTCACCGGC ATCCGCGGGT CGCAGATGCG CTCGGTCGAC
GTCAAGCGGC GCGAGGCGTC GATCGTCGAC GCGATCTCGT CGGAAGACAT CGGCAAGCTG
CCCGACGTCA CCATCGCCGA CTCGCTGCAG CGCATTCCCG GCATCCAGAT CAAACGCGAC
GCCGGCGAGG GCGCGACGGT CAATATCCGC GGCCTGGCCC AGGTCATCAC CCTGCTGAAC
GGCGAGCAAT ATCTCAGCGC CGGCAACATG GGTTCGGCCC AGCCGAACCT GCTGGACGTG
CCCTCGCAGC TGATGAACCA GGTGCTGGTC TACAAGTCGA CCGATCCGAA GAACGCGCTG
TCGGGCATCA CCGGCACGAT CGACCTGCGC ACCCGCCGTC CGTTCCAGAT GGAGCAGGGC
CTGCAGATCG CCGGCGGCGT CGAAGGTCAG CGCGGCGAGC GCACCAAGGA CAACGATTAT
GTGATCAACG GCCTGGCCAG CTGGCGCAAT GAACGCGTCG GCCTGCTGGT CTCGGCCGCC
GCCAGCAAGG CGAACCTCGG CAACAACTCG TCCGGCGTCG TCGGCCTGTC GGGCAATAAC
GACTGGGGCG GCTCGGCCGC CAACAACTTC ATCTCGCCGC ATGGCTTCGA AAGCTTCCAC
CGCGAAGTCG AGCGCAAGCG TCTTGGCGTC AATGTGGCCT TCGAAGCCGA CCTGGGCGAG
GGTTTCACCC TGGTCGCCGA GGGCTTCTAC GCCAAGCTCG ACGAATATAA CCGCGCCGCC
GGCATCAATA TCTCCAACCG TTGGGACGGC GGCGCCTTCG GCGTCTGGAC CACCCCGACG
GTGTTCGAGA ACACCGGCCA GACCAGCGCC GGCAACGGCC GCCCGTGGGT GGCGGTCGAT
GAGTACGACA TCAACGCCTG GTGGGTGAAC AGCTTCGCGG TGAACCGCAC CACCAAGTCG
ACGACCAAGA ACTACAACCT CGAACTGAAG TACGACAACG GCGGCGCGTT CACGAGCGAA
GTGCGGGCCA TCCGCGCCAA CGGCAACCGT CTCAGCATGA ACGGCCAGGC CCAGGGCGAC
CTGAGCAACT GGCAGTACGC GCCCGGCCGC TTCAACCTGT TCCGGGATCC GGGCGACCGT
ACGCGCGGTC CGTTCTATCC CGCTTCGATC TGCGCCCAGT ATCCGACCTC GCAACGCAGC
AACGCCGTCG TCGGCAGCGC CGGCGGCTGC TACCTCAACC CCAACCCGCA AGGCTACGGC
GCCAACCCGC AGCTGCATTA CAATATCGGG GGCAACAAGG CGATCTGGAG CGGCTTCGAC
AATCCGCTGG CCGGCGGCCT GGGGGCGGGC AAGACCCTCA AGGACTACAT GGCCAACAAG
GACAGCTACG CGATCGCGGC GTTCTCCTCG GAAGGCAATA ACGAAATCGA CTCGGACATG
AACGTGTTCC GGGCCGAGGG TCACTACAAG TTCGACAACA AGTTCCTGGG CTTCATCACC
AAGATTGACG CCGGTGTCCG CCAGAGCGAC CGCACGGTCA ATGTCGAGGC CTTCCACCTG
TTCTCGCCGT TCTACGGCGG CACGCCGGGC GCGGTGCAGG CCAACGGCAC GCCGGTGCCG
GCCGCGGGCT GCTTCGCCCA GTGGAAAGCC ATCGACGTGG TCATGAACCA GGACCAGTGT
CAGGCTGGCG AGTTCGTGCC CAACCCCCTG GGCGGCGCTC CGGTGTTCCA GGGCTACACG
GTCAATCGTC CGACCAAGAT CGACGCCTAT AACAACGTGA TCTTCGAGAC CAACCTGGGC
AGCATCACCC AGGGCATCCC CGGCTTCTGG GTCGTCGATC CGCATGACTT CGACAACGTG
GCCGCCTTCC AGACCAAGGT GTTCGGCGCG GCGGTCCGCT ACCAGATCCC CGGCGCGACC
TACGACGTGA CCTTGAAGGA ACAAAGCGCC TACGGCGCGG CCAACTTCGA GATCGGCAAG
CTGTCGGGCA ACGTTGGCCT GCACGTCATC CAGAGCCACT TCCTGGTGAA GCAGAACCTG
ACGGGCGCCA CCCAGAACTA CGGCGACACC AACCTCGATG TCGGCGACAC GGTCACCAAC
CGCAAATACA CCGACTGGCT GCCGTCCTTG AACGCCGTCT ACGACTTCGA CGACCACCTG
AAGTTCCGCT TCGCCTATTC CAAGACCATG CAACCCTTGG ATCTGGGCAA CTACGGCGGC
GGCCTCAGCA TCTCGACCGC CGACTGCGGC ACCAAGCCCG GCCGTTGCGT GACGGGCGCC
AGCTCGTCGG GCAACCCCTA CCTGGATCCG TGGCGCTCGA CGAACTTCGA CGGCGCGATC
GAATACTACT TCGGCGGCGC CTCGATGGTG AACCTGTCGG CCTTCAAGCT GAAGATCGAC
AGCTTCGTCA CCGGCGGCAC CACCACCGGC ACGTTCAAGG ACGAGGACGG CACACCGCGC
ACGGTCAATG TCAGCCAGCC CATCCAGGGC GACGGCGGTT CGGTGAAGGG CGTCGAGGTC
GGCACGAAGC TGGCGTTCAG CGACCTGCTG ACGGATGGCG GCTTCTTCTC GAACTTCGGT
GTCGACGCCA ACGCCACCTA TTCGCCCAGC TCGGAGTCGC GCCTTGGTCT GGACGGCAAG
AAGCTGCCGT TCACCGACAA CTCCAAGTAC CAATACAATC TGATCGGCTG GTACCAGGAC
GACAAGTGGC AGGCCCGCGT GGCCTACAAT TATCGCACTG ACCGCCTGTC TAGCGTCTCG
GGAAGCTCGG GTAACAACCT GCCGATCTAT CAGGACGCCA CGGGCTTCGT GGACGTGAAC
CTCAGCTACA ACGTGCGCGA CAACATCGTC GTCTACGCGA ACGGCTCGAA TGTCACCGGC
GAGATCGAGA ATTACTACGT GAAGTTCGCG GACGGTAAGA CTCAGTACGC CAACCAGAAC
CAGTTCGAAC CCCGCTACAC CATCGGCATC CGCGCCAAGT GGTGA
 
Protein sequence
MSNPRNRSAL ARGMTLALLA GASSTALATG AWAQTAPDAG SQVEEVVVTG IRGSQMRSVD 
VKRREASIVD AISSEDIGKL PDVTIADSLQ RIPGIQIKRD AGEGATVNIR GLAQVITLLN
GEQYLSAGNM GSAQPNLLDV PSQLMNQVLV YKSTDPKNAL SGITGTIDLR TRRPFQMEQG
LQIAGGVEGQ RGERTKDNDY VINGLASWRN ERVGLLVSAA ASKANLGNNS SGVVGLSGNN
DWGGSAANNF ISPHGFESFH REVERKRLGV NVAFEADLGE GFTLVAEGFY AKLDEYNRAA
GINISNRWDG GAFGVWTTPT VFENTGQTSA GNGRPWVAVD EYDINAWWVN SFAVNRTTKS
TTKNYNLELK YDNGGAFTSE VRAIRANGNR LSMNGQAQGD LSNWQYAPGR FNLFRDPGDR
TRGPFYPASI CAQYPTSQRS NAVVGSAGGC YLNPNPQGYG ANPQLHYNIG GNKAIWSGFD
NPLAGGLGAG KTLKDYMANK DSYAIAAFSS EGNNEIDSDM NVFRAEGHYK FDNKFLGFIT
KIDAGVRQSD RTVNVEAFHL FSPFYGGTPG AVQANGTPVP AAGCFAQWKA IDVVMNQDQC
QAGEFVPNPL GGAPVFQGYT VNRPTKIDAY NNVIFETNLG SITQGIPGFW VVDPHDFDNV
AAFQTKVFGA AVRYQIPGAT YDVTLKEQSA YGAANFEIGK LSGNVGLHVI QSHFLVKQNL
TGATQNYGDT NLDVGDTVTN RKYTDWLPSL NAVYDFDDHL KFRFAYSKTM QPLDLGNYGG
GLSISTADCG TKPGRCVTGA SSSGNPYLDP WRSTNFDGAI EYYFGGASMV NLSAFKLKID
SFVTGGTTTG TFKDEDGTPR TVNVSQPIQG DGGSVKGVEV GTKLAFSDLL TDGGFFSNFG
VDANATYSPS SESRLGLDGK KLPFTDNSKY QYNLIGWYQD DKWQARVAYN YRTDRLSSVS
GSSGNNLPIY QDATGFVDVN LSYNVRDNIV VYANGSNVTG EIENYYVKFA DGKTQYANQN
QFEPRYTIGI RAKW