Gene Caul_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1850 
Symbol 
ID5899305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1970548 
End bp1973655 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content64% 
IMG OID641562340 
ProductTonB-dependent receptor 
Protein accessionYP_001683477 
Protein GI167645814 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.922391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.182399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGAC ACCGCCGTTT AAAGACATCT GCGTCCATGC TGGCGCTTTC GACGGCGATC 
TTCGCCGCCG GGCAGGTGCA AGCTCAGACG AACAACGACA GCCAGGCCGT CGAGGAGGTT
GTCGTCACCG GCATCCGGGC GTCCCTGGAG CGGTCGATCG AAATCAAGCG CACGAACAGC
GGCGTCGTCG ACGCCATCTC GGCCGAAGAC ATCGGCAAGT TCCCGGACAC CAATCTGGCT
GAATCGCTCC AGCGCATCAC CGGCGTGTCG ATCGACCGCA CCAACGGCGA AGGTTCGCAG
GTCACGGTCC GGGGCTTTGG CGGCGGCTTC AACCTGGTGA CGCTGAACGG CCGCACCATG
CCCACGGCCA ACGTCTCCAC CGTCGGCGGC GACCAATCGG CCGACTTCCA GACCGGCACC
AGCCGTTCGT TCGACTTCTC GAACCTGGCT TCGGAAGGCG TCTCCACCCT GGAGGTCTAC
AAGACCGGCC GCGCGGGTAT CCCGTCGGGC GGTATCGGCG CGACGATCAA CGTCAAGACC
CGCCGCCCGT TCGACGCCCG CGAGCATGGC TTGAGCGGCA GCATCGGCGC CAAGGCCGTC
TACGACACCA GCATGAAGGA GCGTCTCGAG GACGCCTCGA AGGTCACGCC GGAAGTCTCG
GGCCTGCTGA ACTGGCTCGA TAGCGAGGAA AAGTTCGGCG TCACGCTGTT CGGCAGCTAT
CAGAAGCGCA ATTTCACCAG CCGCAGCGCC ACCTCCAACG ACTGGAACAT CCGGACCTAT
TCGGACTTCC TCAATCCGGC GAACGGCTTC GTGCGTAATG GCGGCGCCAC CCAGATCACC
AACGCGCCTG CCAACGGCAG CACGCTGGTG GCGATCCCCA ACGACAGTCG CTACCACTTC
TCGCAAGGCA ATCGCGAGCG GGTTAACGCC CAGGCGACAT TCCAATGGCG CCCGACCGAG
AACGTCGTGA TCACCGCAGA CGGCCTGTAC GCCCAGAACA AGTCGTTCGA ACGCCGTAAC
GACCAGACCA ACTGGTTCAA CCGCCCGTTC GACAAGGTGA CCTTCGACAA GAACCCCACC
GTCGCCACGG CCGTCTTCCT GCAGGAAACC CTCAGCGGGA CGAAGGACAC GGGCTTCGAG
CAGCAGTATC GCGCCAATGA AGACAAGCTG CAGTCGTTCG GCCTGAACGG CGTCTGGGAC
GTCACCGACC GCTTCAAGGT CTCGTTGGAC GGCCACGTGT CCAAGGCCGA GAGCAACCCC
GACGCGCCAA ATGGCACCAG CTCGACCTCG GTCAGCATTG GCGCACCGAT CATCTCGTCG
CACTCGGTGG ACTACAGCGG CAAGATTCCG GTCCAGGCCG TGACGATCAA CGACGCCCTG
CCGCGCGGCA ACGGCAACGG CAAGCTCGAC ATCGGCGACC TGGGCAGCCA GGTGGCCCGC
ACCTCGGCCC AGAGCCAAGA CCAGAAGATC AAGGAATTCC GCGCCGACGC CTCCTATGAG
CTGGATGACA ACGGCAGCAA GTTCGACTTC GGCTTCGACT ACCGCGCCTC GAAGATGAAG
CAGGCCAGCA TCAACACCCA GCAGGACCTG GGCAGCTGGG GCATCTCGAA CCCCCGCGAC
GTGCAGCAGT ACGCTGGCAG TCTGGTCAAG GAGTTCTGCA TGAGCTGTCG GTTCGACAGC
TTCGATCCCA AGCAAAGCGG CGTCGGTCTG GTCGCCTTCC GAGCCAACGC GATCGACCTC
TACAACGCCC TGTCGGCGCC TTATGTCGCG CTCGGCAACA AGGTGGGCGT CACCAGCCAG
GACGACAACC GCGTCAACGA GGACGTCTGG GGCGTCTACG GCCAGTTGAC CTGGAAGGGC
GAACTGGCCG GCCGCGAAGC CAGCATGGTC GCCGGCGCGC GCTACGAGGA AACAAAGTCC
AAGTCCGTGT CGCTGATCCG GACGCCCCAG GCCATCGTCT GGACCGCCGA CAACGACTTC
CGCGTCGATA CCGCGACGAC CTACTCGCCC ATCTCGGGCA AAAACAAGTA TAGCAATCTG
CTGCCCGCTC TGGACTTCCA GGTGGAGCTG GCCAAGGACC TGTTGGGGCG CTTCTCGTTC
AGCCGCACGA TCGCGCGGCC GGACTACGGC AACCTGTTCG CCTCGGTCTC GGCCCAGGCG
CCGAACCGCC CGATCGCCAA CGGGGCCATT CCTCTTGGCA CGCGCGGCAA TCCGGAATTG
CAGCCGCTGA TCTCGGACAA CTTCGACGTC TCGATCGAGT GGTACTACAA GCCCAGCAGC
TATGTTTCGG CCGGCTTCTT CGAGAAGCGC GTGAACAACT TCGTGGGCAC GGGCACCTTC
AACCAGAGCC TGTTTGGCCT TCGTGACCCC AGCTCCGGCG CCGCCGGAAC GCGGTCGGGC
GACGCTCGCG CCTCTTTGAC CACGATCGGC GCCAACCAGA CCGACGTCAA TCTGTTCACG
ATGACCTCGT TGATCCAGAC CACGGGTTCG GTGGCGGCCG CCACGGCCGT CTTCCAGGCC
AACCGCGGTC CTTCAGGTGA CCTCAACCAA GCCTTCGTCG ACCAAGTTCT GGCCGCGACG
GACGTCTCGC CCACCGCCGC CGATCCGCTG TTCAACTTCC AGGTCGCCCA GCCGATCAAC
AACAAGACCG GCAAGATCCA CGGCTTCGAA ATCGCCGCCC AGCACTTCTT CGGCGATACC
GGCTTCGGGC TCTCGGGCGC ATACACCCTG GTGCGCGGCG ATGTGGGCTT CGATATCGCT
TCCGACCCGG GCCAGGACCA GTTCGCCTTG CTGGGTCTGA GCGACACGGC CAACGCGACC
CTGATCTACG AGAAGAACGG GCTCTCGGCC CGCGTGGCCT ACAACTGGCG CGATAAGTTC
CTGCAAGCCA CCAACCGCGG CGGCTCGCGC AACCCGGTGT TCGTCGCCCC GTTCGGCCAG
GTGGACTTCA ACGTCAGCTA CGACGTGACC TCCAGCCTGG CGATCTCGCT GGAAGGCATC
AACCTGACCA AGGAGAACCT GCGCACCTAC GCTCGCGATG AAAACGAGCT GTGGTACGCG
CAAGAACTGG ACCGGCGCTT CCTGCTCGGA GCGCGCTATC GCTTCTAG
 
Protein sequence
MMRHRRLKTS ASMLALSTAI FAAGQVQAQT NNDSQAVEEV VVTGIRASLE RSIEIKRTNS 
GVVDAISAED IGKFPDTNLA ESLQRITGVS IDRTNGEGSQ VTVRGFGGGF NLVTLNGRTM
PTANVSTVGG DQSADFQTGT SRSFDFSNLA SEGVSTLEVY KTGRAGIPSG GIGATINVKT
RRPFDAREHG LSGSIGAKAV YDTSMKERLE DASKVTPEVS GLLNWLDSEE KFGVTLFGSY
QKRNFTSRSA TSNDWNIRTY SDFLNPANGF VRNGGATQIT NAPANGSTLV AIPNDSRYHF
SQGNRERVNA QATFQWRPTE NVVITADGLY AQNKSFERRN DQTNWFNRPF DKVTFDKNPT
VATAVFLQET LSGTKDTGFE QQYRANEDKL QSFGLNGVWD VTDRFKVSLD GHVSKAESNP
DAPNGTSSTS VSIGAPIISS HSVDYSGKIP VQAVTINDAL PRGNGNGKLD IGDLGSQVAR
TSAQSQDQKI KEFRADASYE LDDNGSKFDF GFDYRASKMK QASINTQQDL GSWGISNPRD
VQQYAGSLVK EFCMSCRFDS FDPKQSGVGL VAFRANAIDL YNALSAPYVA LGNKVGVTSQ
DDNRVNEDVW GVYGQLTWKG ELAGREASMV AGARYEETKS KSVSLIRTPQ AIVWTADNDF
RVDTATTYSP ISGKNKYSNL LPALDFQVEL AKDLLGRFSF SRTIARPDYG NLFASVSAQA
PNRPIANGAI PLGTRGNPEL QPLISDNFDV SIEWYYKPSS YVSAGFFEKR VNNFVGTGTF
NQSLFGLRDP SSGAAGTRSG DARASLTTIG ANQTDVNLFT MTSLIQTTGS VAAATAVFQA
NRGPSGDLNQ AFVDQVLAAT DVSPTAADPL FNFQVAQPIN NKTGKIHGFE IAAQHFFGDT
GFGLSGAYTL VRGDVGFDIA SDPGQDQFAL LGLSDTANAT LIYEKNGLSA RVAYNWRDKF
LQATNRGGSR NPVFVAPFGQ VDFNVSYDVT SSLAISLEGI NLTKENLRTY ARDENELWYA
QELDRRFLLG ARYRF