Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1850 |
Symbol | |
ID | 5899305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1970548 |
End bp | 1973655 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562340 |
Product | TonB-dependent receptor |
Protein accession | YP_001683477 |
Protein GI | 167645814 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.922391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.182399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGAC ACCGCCGTTT AAAGACATCT GCGTCCATGC TGGCGCTTTC GACGGCGATC TTCGCCGCCG GGCAGGTGCA AGCTCAGACG AACAACGACA GCCAGGCCGT CGAGGAGGTT GTCGTCACCG GCATCCGGGC GTCCCTGGAG CGGTCGATCG AAATCAAGCG CACGAACAGC GGCGTCGTCG ACGCCATCTC GGCCGAAGAC ATCGGCAAGT TCCCGGACAC CAATCTGGCT GAATCGCTCC AGCGCATCAC CGGCGTGTCG ATCGACCGCA CCAACGGCGA AGGTTCGCAG GTCACGGTCC GGGGCTTTGG CGGCGGCTTC AACCTGGTGA CGCTGAACGG CCGCACCATG CCCACGGCCA ACGTCTCCAC CGTCGGCGGC GACCAATCGG CCGACTTCCA GACCGGCACC AGCCGTTCGT TCGACTTCTC GAACCTGGCT TCGGAAGGCG TCTCCACCCT GGAGGTCTAC AAGACCGGCC GCGCGGGTAT CCCGTCGGGC GGTATCGGCG CGACGATCAA CGTCAAGACC CGCCGCCCGT TCGACGCCCG CGAGCATGGC TTGAGCGGCA GCATCGGCGC CAAGGCCGTC TACGACACCA GCATGAAGGA GCGTCTCGAG GACGCCTCGA AGGTCACGCC GGAAGTCTCG GGCCTGCTGA ACTGGCTCGA TAGCGAGGAA AAGTTCGGCG TCACGCTGTT CGGCAGCTAT CAGAAGCGCA ATTTCACCAG CCGCAGCGCC ACCTCCAACG ACTGGAACAT CCGGACCTAT TCGGACTTCC TCAATCCGGC GAACGGCTTC GTGCGTAATG GCGGCGCCAC CCAGATCACC AACGCGCCTG CCAACGGCAG CACGCTGGTG GCGATCCCCA ACGACAGTCG CTACCACTTC TCGCAAGGCA ATCGCGAGCG GGTTAACGCC CAGGCGACAT TCCAATGGCG CCCGACCGAG AACGTCGTGA TCACCGCAGA CGGCCTGTAC GCCCAGAACA AGTCGTTCGA ACGCCGTAAC GACCAGACCA ACTGGTTCAA CCGCCCGTTC GACAAGGTGA CCTTCGACAA GAACCCCACC GTCGCCACGG CCGTCTTCCT GCAGGAAACC CTCAGCGGGA CGAAGGACAC GGGCTTCGAG CAGCAGTATC GCGCCAATGA AGACAAGCTG CAGTCGTTCG GCCTGAACGG CGTCTGGGAC GTCACCGACC GCTTCAAGGT CTCGTTGGAC GGCCACGTGT CCAAGGCCGA GAGCAACCCC GACGCGCCAA ATGGCACCAG CTCGACCTCG GTCAGCATTG GCGCACCGAT CATCTCGTCG CACTCGGTGG ACTACAGCGG CAAGATTCCG GTCCAGGCCG TGACGATCAA CGACGCCCTG CCGCGCGGCA ACGGCAACGG CAAGCTCGAC ATCGGCGACC TGGGCAGCCA GGTGGCCCGC ACCTCGGCCC AGAGCCAAGA CCAGAAGATC AAGGAATTCC GCGCCGACGC CTCCTATGAG CTGGATGACA ACGGCAGCAA GTTCGACTTC GGCTTCGACT ACCGCGCCTC GAAGATGAAG CAGGCCAGCA TCAACACCCA GCAGGACCTG GGCAGCTGGG GCATCTCGAA CCCCCGCGAC GTGCAGCAGT ACGCTGGCAG TCTGGTCAAG GAGTTCTGCA TGAGCTGTCG GTTCGACAGC TTCGATCCCA AGCAAAGCGG CGTCGGTCTG GTCGCCTTCC GAGCCAACGC GATCGACCTC TACAACGCCC TGTCGGCGCC TTATGTCGCG CTCGGCAACA AGGTGGGCGT CACCAGCCAG GACGACAACC GCGTCAACGA GGACGTCTGG GGCGTCTACG GCCAGTTGAC CTGGAAGGGC GAACTGGCCG GCCGCGAAGC CAGCATGGTC GCCGGCGCGC GCTACGAGGA AACAAAGTCC AAGTCCGTGT CGCTGATCCG GACGCCCCAG GCCATCGTCT GGACCGCCGA CAACGACTTC CGCGTCGATA CCGCGACGAC CTACTCGCCC ATCTCGGGCA AAAACAAGTA TAGCAATCTG CTGCCCGCTC TGGACTTCCA GGTGGAGCTG GCCAAGGACC TGTTGGGGCG CTTCTCGTTC AGCCGCACGA TCGCGCGGCC GGACTACGGC AACCTGTTCG CCTCGGTCTC GGCCCAGGCG CCGAACCGCC CGATCGCCAA CGGGGCCATT CCTCTTGGCA CGCGCGGCAA TCCGGAATTG CAGCCGCTGA TCTCGGACAA CTTCGACGTC TCGATCGAGT GGTACTACAA GCCCAGCAGC TATGTTTCGG CCGGCTTCTT CGAGAAGCGC GTGAACAACT TCGTGGGCAC GGGCACCTTC AACCAGAGCC TGTTTGGCCT TCGTGACCCC AGCTCCGGCG CCGCCGGAAC GCGGTCGGGC GACGCTCGCG CCTCTTTGAC CACGATCGGC GCCAACCAGA CCGACGTCAA TCTGTTCACG ATGACCTCGT TGATCCAGAC CACGGGTTCG GTGGCGGCCG CCACGGCCGT CTTCCAGGCC AACCGCGGTC CTTCAGGTGA CCTCAACCAA GCCTTCGTCG ACCAAGTTCT GGCCGCGACG GACGTCTCGC CCACCGCCGC CGATCCGCTG TTCAACTTCC AGGTCGCCCA GCCGATCAAC AACAAGACCG GCAAGATCCA CGGCTTCGAA ATCGCCGCCC AGCACTTCTT CGGCGATACC GGCTTCGGGC TCTCGGGCGC ATACACCCTG GTGCGCGGCG ATGTGGGCTT CGATATCGCT TCCGACCCGG GCCAGGACCA GTTCGCCTTG CTGGGTCTGA GCGACACGGC CAACGCGACC CTGATCTACG AGAAGAACGG GCTCTCGGCC CGCGTGGCCT ACAACTGGCG CGATAAGTTC CTGCAAGCCA CCAACCGCGG CGGCTCGCGC AACCCGGTGT TCGTCGCCCC GTTCGGCCAG GTGGACTTCA ACGTCAGCTA CGACGTGACC TCCAGCCTGG CGATCTCGCT GGAAGGCATC AACCTGACCA AGGAGAACCT GCGCACCTAC GCTCGCGATG AAAACGAGCT GTGGTACGCG CAAGAACTGG ACCGGCGCTT CCTGCTCGGA GCGCGCTATC GCTTCTAG
|
Protein sequence | MMRHRRLKTS ASMLALSTAI FAAGQVQAQT NNDSQAVEEV VVTGIRASLE RSIEIKRTNS GVVDAISAED IGKFPDTNLA ESLQRITGVS IDRTNGEGSQ VTVRGFGGGF NLVTLNGRTM PTANVSTVGG DQSADFQTGT SRSFDFSNLA SEGVSTLEVY KTGRAGIPSG GIGATINVKT RRPFDAREHG LSGSIGAKAV YDTSMKERLE DASKVTPEVS GLLNWLDSEE KFGVTLFGSY QKRNFTSRSA TSNDWNIRTY SDFLNPANGF VRNGGATQIT NAPANGSTLV AIPNDSRYHF SQGNRERVNA QATFQWRPTE NVVITADGLY AQNKSFERRN DQTNWFNRPF DKVTFDKNPT VATAVFLQET LSGTKDTGFE QQYRANEDKL QSFGLNGVWD VTDRFKVSLD GHVSKAESNP DAPNGTSSTS VSIGAPIISS HSVDYSGKIP VQAVTINDAL PRGNGNGKLD IGDLGSQVAR TSAQSQDQKI KEFRADASYE LDDNGSKFDF GFDYRASKMK QASINTQQDL GSWGISNPRD VQQYAGSLVK EFCMSCRFDS FDPKQSGVGL VAFRANAIDL YNALSAPYVA LGNKVGVTSQ DDNRVNEDVW GVYGQLTWKG ELAGREASMV AGARYEETKS KSVSLIRTPQ AIVWTADNDF RVDTATTYSP ISGKNKYSNL LPALDFQVEL AKDLLGRFSF SRTIARPDYG NLFASVSAQA PNRPIANGAI PLGTRGNPEL QPLISDNFDV SIEWYYKPSS YVSAGFFEKR VNNFVGTGTF NQSLFGLRDP SSGAAGTRSG DARASLTTIG ANQTDVNLFT MTSLIQTTGS VAAATAVFQA NRGPSGDLNQ AFVDQVLAAT DVSPTAADPL FNFQVAQPIN NKTGKIHGFE IAAQHFFGDT GFGLSGAYTL VRGDVGFDIA SDPGQDQFAL LGLSDTANAT LIYEKNGLSA RVAYNWRDKF LQATNRGGSR NPVFVAPFGQ VDFNVSYDVT SSLAISLEGI NLTKENLRTY ARDENELWYA QELDRRFLLG ARYRF
|
| |