Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1319 |
Symbol | |
ID | 5898774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1394550 |
End bp | 1397750 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641561804 |
Product | TonB-dependent receptor plug |
Protein accession | YP_001682947 |
Protein GI | 167645284 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0274612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATGA CTTCCACCAA GGGCGGCGCT CGCGCGCGTC TTTTGACGTC GACGCTGCTG GCTGGCCTGG CCACCGTCGC CGCGCCGCTG GCGATCACGG CGATCGCCAC GGCGATCCCG ACCCTGGCCT CGGCGCAGGA CTACACGAGC GGCACCCTCG TCGGCACCGT CCGTGACGCC AGCGGCGCTC CGGTCAGCGG CGCCGCCGTC ACCGTCAAGT CGCTGGGTCA AGGCTTCACC CGTCAACTGG TCACCGGCAG CGACGGTCAG TTCCGCGTGC CGCTGGTGCC GCAAGGCGGC TATTCGGTCG CGATCTCCAA GGAGGGCTTC CAGCCCACGA GCGACGGCGC CGTCGCCGTG CGTTCGGGCG GCGACAGCGC CTACAGCTTC ACCCTCTCGT CGGCTGACGC GTCGGTTTCG GAAGTCGTGG TCACCGCCAC CGCCAATCCG CAACTCGACT TCGGCGGCAC CACCACCGGC CTGTCGGTCG ACCTGGAAAC CCTGACCAAG CAGGTTCCCG TCAACCGCAC CATCACCAGC GTCGTTCTGC TGGCGCCCGG CGCGGTTCAG GGCAGCAACA CCAACTTCCG TGGCCAGCCT TCCATCGGCG GTTCGTCGGT CGCTGAAAAC GCGTTCTACG TGAACGGCCT GAACATCACG AACTTCGACA ACTACCTGGG CGGCTCGACC GTCCCGTTCG ACTTCTACAA GTCGGTGGAC GTGAAGACCG GCGGCTATCA AGCCGAATTC GGCCGTTCGA CCGGCGGCAT CGTCAACGCC GTCACCAAGG CCGGCACCAA CGAGTTCAAG TTCGCCGTCC GCGGCCAATG GGAACCCGAC AGCCTGCAAG AAGACCAGAA GGACACCTTC CTGCGTCGCG GCAAGCTGGC CAAGACCGAC AACAAGTCGC TGACCCTGGA AGCCGGCGGT CCGATCATCC CCGATCGCCT GTTCTTCTTC GCCATGACGC AGATGCGCGA CAACCAAACG ACGTTTGGCA GCATCACGGG CGGCAGCTAC AATAAAGAAA CCCAGCGCGA CCCCTTCTAT GGCCTGAAGC TGGACGGCTA CATCACCGAC CGCCAGCACC TCGAATTCAC CTATTTCGAC ACCAAGGGTT CGGCCAAGCG CAGCACTCGG CAATACGAGT TCGACGACAC CACCGGCACC GACACCTTCG GCGACAAGCT GGGCGGCACC CTGTTCTCGC TCGGCGGCGC AAACTATGTC GGCAAGTACA CCGGCACGTT CACCGACTGG TTCACCCTGT CGGCGGCCTA CGGCGTCACC AAGGACAGCT ATCGCGTCAC CCCGCAGGAT CTGTCGGGCA ACTACGTAAC GAACACGGCT GATCCCGCCC ACCCTGGCGA GACTTCGGTC ATCAGCCGTC AAAAGACGTC GTCCTATGAC TCGAGCTACG AAACCAAGCG CGAGTTCTAC CGCATCGACG CCGACTTCTA CTTCGACCTG CTGGGCAAGC ACCACATTCG CGCTGGCTAC GACCAAGAAG ACCTGACCCT GGATCACGTG AACCAGTATC CGGGCGCCGG CACCGATTGG GACTTCCTCC TGGCGGGTGC GACTGACGCG CGTGGCGTGG CTGCGGGGCA GACCTACGTC AAGGGCCGCA CGTTCAAGAC CGGCGGCATC TTTGAAGGCA CCAACAAGGC CTATTACATC CAAGACTCGT GGGACATTCT GTCCAACCTG ACCCTGAACC TGGGCATCCG CAAGGACCAG TTCCAGAACA GCGGCGCTCG CACCGCCAAG GGCAGCGAAA CCTTCGTCGA GTTCGACAAC GAAATCGGTC CGCGGATCGG CTTCACCTTC GACCCGTTCA GCACCGGCAA CGACAAGATC TTCGGTAACT TCGGTCGCTA CTACCTGCCG GTCGCGTCGA ACACCGCGTT CCGTCAAGCC ACGGCCAGCT ACGACATCGA CACCTTCTTC ACCGCGCCCC AGGGCGTGGC GCTGGGCGCC GATGGCACGC CGATCCGTGG CACGCAGATC ACTCAGACGA CCAACCCGGG CTTCGCCTCG GCGGCCGCCT GTCCGGCTCC GGTGGCTGGC GTGACCCCGC CCGGCGCCAC CGACGCGGTC GGCTGCGCCG TCCGTGGCGA CGGTTCGCTG CAACCCTTCG CCGCGAACAC CAGCAAGAAC CTGAAGTCCA CCCAGGAAGA CGAATACATC CTGGGTTACG AGCACCAGTT CAACTCGCTG TGGAAGGCCA GCGCCACGCT GACCTATCGC AACCTGAACC GGGTTTCGGA AGACGTCGCC ATCGACGCCG CGGTGCGCAA CTACTGTGTC AAGAACGGCA TCGCCGGCTG CGGCTCGACC TACAATGTCG CCGGTCCGAC CCCGGGCTGC ACCACCTTCT CGGCCGGTCC GCGCGCCGGG CAGACGCGTT GCGCGGGCTT CTCGGGCTTC CGTCAGTACA CGATCGTCAA CCCGGGCGAG GCCTCGACCA TCACCCTGCG GCAGCCGCTG CCGGGTGAAG CCACGGCTCG CACCATCAGC TTCTCGAAGG CTGACCTGGG CTATCCGACC GTGAAGCGCG AATATGTCGG TCTGGAAATG AAGGTCGAAC GCGCCTTCGA CGGCAAGTGG GGCTTCCAGG GCTCGTACGT CCTGGCGGAA TCGAAGGGCA ACTACGAAGG CTTCGTGAAG TCGGACGCCG GCAACGGTCA AACCGACTCG GGCATCACCC AGGACTTCGA CCAGGTGAGC CTGACCGACG GCGCCTACGG CCTGCTGCCG AACCACCACG CCCACCAGTT CAAGCTGTTC GGTTCCTATG CGATCACCGA CAACCTGCTG GTCGGCGGCA ACGCCCTGGT CCTGTCGCCC AAGCATTACG GCTGTATCGG TCTTCACCCG ACCGACGACA TCGTGAACTC GGGCTACGGC GTGGCGTCCT TCGCCTGCGG CGGCAAGATC GTTCCGCGCG GTTCGGCCTT CGAAACGCCC TGGACGGCTC GTCTGGATAT CGCGGTGCGT TATCTGGTGC CGACCACTAA GTTCATCCCG GGTGGCCTGA CCCTGCGCGC CGACATCAGC AACATCCTGA ATTCTCGGAC CGAGACCGAA GCTTGGGAAT TTGGCGACAG CGACGCGGGT GGTGCGGACG AGCACTACAA GGACCCGATC CAATACCAAG CGCCCCGTTC GGTGCGTCTG GGCTTCGACT GGGAGTTCTA G
|
Protein sequence | MKMTSTKGGA RARLLTSTLL AGLATVAAPL AITAIATAIP TLASAQDYTS GTLVGTVRDA SGAPVSGAAV TVKSLGQGFT RQLVTGSDGQ FRVPLVPQGG YSVAISKEGF QPTSDGAVAV RSGGDSAYSF TLSSADASVS EVVVTATANP QLDFGGTTTG LSVDLETLTK QVPVNRTITS VVLLAPGAVQ GSNTNFRGQP SIGGSSVAEN AFYVNGLNIT NFDNYLGGST VPFDFYKSVD VKTGGYQAEF GRSTGGIVNA VTKAGTNEFK FAVRGQWEPD SLQEDQKDTF LRRGKLAKTD NKSLTLEAGG PIIPDRLFFF AMTQMRDNQT TFGSITGGSY NKETQRDPFY GLKLDGYITD RQHLEFTYFD TKGSAKRSTR QYEFDDTTGT DTFGDKLGGT LFSLGGANYV GKYTGTFTDW FTLSAAYGVT KDSYRVTPQD LSGNYVTNTA DPAHPGETSV ISRQKTSSYD SSYETKREFY RIDADFYFDL LGKHHIRAGY DQEDLTLDHV NQYPGAGTDW DFLLAGATDA RGVAAGQTYV KGRTFKTGGI FEGTNKAYYI QDSWDILSNL TLNLGIRKDQ FQNSGARTAK GSETFVEFDN EIGPRIGFTF DPFSTGNDKI FGNFGRYYLP VASNTAFRQA TASYDIDTFF TAPQGVALGA DGTPIRGTQI TQTTNPGFAS AAACPAPVAG VTPPGATDAV GCAVRGDGSL QPFAANTSKN LKSTQEDEYI LGYEHQFNSL WKASATLTYR NLNRVSEDVA IDAAVRNYCV KNGIAGCGST YNVAGPTPGC TTFSAGPRAG QTRCAGFSGF RQYTIVNPGE ASTITLRQPL PGEATARTIS FSKADLGYPT VKREYVGLEM KVERAFDGKW GFQGSYVLAE SKGNYEGFVK SDAGNGQTDS GITQDFDQVS LTDGAYGLLP NHHAHQFKLF GSYAITDNLL VGGNALVLSP KHYGCIGLHP TDDIVNSGYG VASFACGGKI VPRGSAFETP WTARLDIAVR YLVPTTKFIP GGLTLRADIS NILNSRTETE AWEFGDSDAG GADEHYKDPI QYQAPRSVRL GFDWEF
|
| |