Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0572 |
Symbol | |
ID | 5898027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 621332 |
End bp | 623608 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641561054 |
Product | TonB-dependent receptor |
Protein accession | YP_001682203 |
Protein GI | 167644540 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACC GACAACTTGG CCTCACCGTC TCACGTCTGG CGATTTGCGC AGGCCTGATG GGATCATTCT TTGCAGCGGA GGCCCGCGCC CAGACAGAAC CTGGCGCCGT CGCCCACGGC GCCGACCAGC AACCCGCCGC CATCGGACTT GAAGAGATCG TTGTCACCGC GACGCGCCGC GAGACTTCAC TCCAAAAAAC AGGACAAGCG ATCTCGGTCG TCTCGGGGGC GGAGCAACTG CTGACGGGCC GTCAGGTCCT GGATGACCTG AAAACCTCAA TGCCCAACGT CAACTTCGCG TCGACCAGCA ATACGACGCA GATGTTCGTT CGCGGCATCG GCAACACGTT CCTGACGGCC GGCGGCGATC CGGGCGTCGC CCTCTATCAG GACACCGCCT ATATCTCGGA CCAAACCACC TCCAGCGTCA GTTTCTTCGA TGTGGAGCGT GTGGAAGTCC TGCGAGGACC GCAGGGCGGG CTCTATGGTC GTAACGCCAC GGGCGGCGCC ATCAATATCA TTTCGGCCAA GCCGACGTCC GAGATGACCG GTCGGCTTTC CGTGCTCGCC GGCGACTATG GTCGGCTGGA GAGCGAGGGC TTCGTCTCGG GGCCGCTGGG CTTTGCCAAC ACCGACTTTC GTCTTTCATA CCAGCTTCAT CGATTGGATG GCTTTGTCCG CAACATCTAT GAGCCGACCG TCGGCGCTGC GGGTTTCGCC GCTGCGCCGG ACCGGCTCGA TGACATGAAT TCCGATGCGG TGCGCCTTCA GACGGCGACG CAATTGGGTT CGGGCGGCGT CTTGCGTGTC ATCGTCACCC ACTATCGTGA AACGGACAAT GGCGCCGCCC TCGCGGTCGT GCCCGACTCT GGCTTCATCT ATCCGGCCCA GCATCTCTAC GGTCTTGTTC CAACCAGCGA TCCGCGCAAC ATCACCGTCG ATGAGGGCTA CAACAAGATC CGCCTGACCA ATTACAACGC CAACCTCGAT CAACCGATCG CGGACGGTAT GCTGACGGTC ACCGGCAACT ATCGCCGCTC CCACCGGGAC TTTCTCAACG ACTGCGACGG GACGTCCGCC AATAGCTGTT CGTTCCGGGT CGAAACCTCG AGCGACGACT ATTTCGGCGA TATTCACTAC GCCTCGTCAA ACGAAGGCCC GTTCCGTTAT ATTGTCGGCG CGACCTACAC GCACTTCATC CAAGATCAAA TCGCCACCAT ACCGTTTGAA TTTCCGACGT TTTATCTGAC GGGCAACCCG GCCGATCGAG CAGCGTTTGA TTTTCCGACG GCGTCGGGCG GAAGGCTCCG GACGAACGCG TGGGCGGCGT ATGCGGACGC GCGATACGCC TTGTCGGACG TCTGGTCGCT GATCGGACAA GTCCGTTATA GCCGCGCGAG GAAAAAGGCG CTCGAAACCC TGAAGTTGCC CGCCTTCGGT CTTGTCGTCG TCGACTCGCC CAATCGCGCC AGTGACTCCG GCGTACCCTT CAAGGTCGGT GTGGAAGGAC AACTGACGAA CGATGTCCTG GTCTATGGCA ACTTCTCCAC CGGGTTGAAG GACGCGGCAA TAAACCTTGG CACCCTGCAA ACGGCGCCGG TGGCCAAGGA AACGGTTCAA AGCGTCGAGG TTGGCTTCAA ATCAAGCTTG TTTGACCGCC GGCTGCGGAT CAACGGCGCG GCCTTCAACA GCGACTATAA GAACTTGCAG ATCTCCCAGC TCAAGGGGAC CTTGGCCACG CTCGCCAACG CCCCGAAAGC TCAGATCAGC GGCGCCGAAC TGGAGGTTAC CGCAGAGCCG GTGTCGGGGC TGCGATTCAA TGGCAGCGTC GGCTATCTCG ATCCCAAGCT CAAGCGGTTC GAAAACACGC CGAACCTGCC GGCCGATTTC GTTCCCCAGC CTGTCCTCGT GTCGTTGGAT GGAAATCAGC TGCCCTATGT CGCGAAGTGG AACGTCACCC TGGGCGCAAC CTACAGGTTC GAACCTACCG CGGGCATCGC CGTGGAGCTG GGCGGAAACT ACTACTACCA GAGCCGGATC TACTTCAACG AATTCAATGC ATTGAGAAAT TCTCAAAAGC CGGTCGGGCG GGTCGATCTG TCGGCTTCGA TCGGGCCGTC CAATGACCAA TGGAAAGTCT ATGGCTACAT CCGCAACCTG ACCGATGAAA CGGTGCTGGC CGGAACAACG ATCTATGCGG GACTGCTGGG GGCTGAAAAG GGTGTGTCCT ATGCGCCTCC CCGCAACTTC GGCATCGGAT TCTCCTACAA TTTCTGA
|
Protein sequence | MSHRQLGLTV SRLAICAGLM GSFFAAEARA QTEPGAVAHG ADQQPAAIGL EEIVVTATRR ETSLQKTGQA ISVVSGAEQL LTGRQVLDDL KTSMPNVNFA STSNTTQMFV RGIGNTFLTA GGDPGVALYQ DTAYISDQTT SSVSFFDVER VEVLRGPQGG LYGRNATGGA INIISAKPTS EMTGRLSVLA GDYGRLESEG FVSGPLGFAN TDFRLSYQLH RLDGFVRNIY EPTVGAAGFA AAPDRLDDMN SDAVRLQTAT QLGSGGVLRV IVTHYRETDN GAALAVVPDS GFIYPAQHLY GLVPTSDPRN ITVDEGYNKI RLTNYNANLD QPIADGMLTV TGNYRRSHRD FLNDCDGTSA NSCSFRVETS SDDYFGDIHY ASSNEGPFRY IVGATYTHFI QDQIATIPFE FPTFYLTGNP ADRAAFDFPT ASGGRLRTNA WAAYADARYA LSDVWSLIGQ VRYSRARKKA LETLKLPAFG LVVVDSPNRA SDSGVPFKVG VEGQLTNDVL VYGNFSTGLK DAAINLGTLQ TAPVAKETVQ SVEVGFKSSL FDRRLRINGA AFNSDYKNLQ ISQLKGTLAT LANAPKAQIS GAELEVTAEP VSGLRFNGSV GYLDPKLKRF ENTPNLPADF VPQPVLVSLD GNQLPYVAKW NVTLGATYRF EPTAGIAVEL GGNYYYQSRI YFNEFNALRN SQKPVGRVDL SASIGPSNDQ WKVYGYIRNL TDETVLAGTT IYAGLLGAEK GVSYAPPRNF GIGFSYNF
|
| |