Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1790 |
Symbol | |
ID | 5899245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1891092 |
End bp | 1893542 |
Gene Length | 2451 bp |
Protein Length | 816 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562280 |
Product | TonB-dependent receptor |
Protein accession | YP_001683417 |
Protein GI | 167645754 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0607055 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAACC AACGGCTTTT GATGGCGGGA TCTTCCGTCG CTGTTCTGCT CGCCCTGTCC AGCCAGGCCT TCGCGGCCGA CCCTGTCGCG GCCGATACGT CGGTGGCCGT CGATGAGATC GTGGTCAAGG CGCGGGACAA GGCGGGTCTG CTGGAGACCC GGCCCAACAA CACGGTGTTC GGGCTGGACA AACCGCTGCT GGAGACGCCG CGCTCGGCCA GCTTCGTCAG CGACACCACC CTGCAACGCT ACGGCATCGA GACGATCGAC GGCCTGACCG CCGTCTCGCC CGGCACCTAC ACCGCCAGCT TCTACGGCGT GCCTGGCGCG CTGAACATCC GCGGCACCCT GGCCGAGAAC TATTTCCGCG GCTTCAAGCG CATCGAGAAC CGCGGCACCT ATTCGACCCC GATCGGCGCG GCCGACCAGA TCCAGATCGT CCGCGGCCCG CCGACCCCGA TCTACGGCTC GGGCAAGGTG GGCGGCATGC TCAACTTCAT TCCCAAGTCA GGGAAGAACG AGGGCGGCTA TCTGTCGGAA CCGACCGGCG AGGTGACCGC CACCTACGGC TCGTACAACA AGAAGAACGC CACCGCGCAG CTGGGCCTGC CGGTGAACTT CGGCCCCGTG ACCGGCGGCG TCTACGCCTA TGGCGAGGTC GAGGACAGCC ACAGCTTCTA CAAGGGCGTC TATCCGCGTC GACAGACCGG CGAGATCTCG GCCGACTTCG ACCTGGGCAA CGGCTGGAGC ACGGCGTTCG GCGGGATGAA GTACCACTCG GACGGCGACG TGCAGACGCC GGGCTGGAAC CGCCTGACCC AGAGCCTGAT CGACACCGGG ACCTACACCA CCGGCCGCGA CACCACCCTG GTCGACAGCG ACCACAACGG CCGCATGACC CTGAACGAGA TCAGCGGCAA CAGCGCCAAC CCCTACTATT ACGACCCGGC GTTCAACCCG CTCTACATCC CATACTACAA CTTCTACAAC ACCAACGCCG CCCACGTCCT GGACGCTGGC GTCGGCGCGA CCAAGCTGTC GCCGCGCACG GTCTATATCA GCCCGGCCGA CTTCTCGAAG ACCGACACCA ACACCCTCTA TTTCGACCTG GCCAAGACCC TGTCGCCGAG CAGCACGATC AAGGCCCAGC TGTTCTACGA CGACCAGGAG AACAAGCGCT TCGTCTCGTA CGGCTACCCA GCCTGGTTCG ACAGCTCGGT CTGGGAAGCG CGCCTGACCT ACAATTTCGA AAACGAGTTC ATGGACGGGG CGGTCAAGGC CAAGTCGTTC ATCGGGGCCT CATACCGCGA CTTCTCGGGC CGCCGGCGCG AAAGCTACAA CAGCGGCGTG ATCGCTCTGG ACCGTCGCGA CATCAGCTAC GGCGCCACGG CCACCGACAT CATCGACAGC CCGTTCACCA CCGAGACCGG TTCTGGCGTT CTGGGCCTGG CTTGGGAAAA CGACAACAAG TCCGACTGGC AACAGAAGGG CGTGTTCTTC ATGAGCGACG TCACGGTCGG CGAGAAGCTG AACCTGATGG TCGGCGGTCG CTACGACGAC TACGACGTCA AGTCGCACGA CACCGGCGTG CTCAGCTACC AGGTGTCGGG CGAGCAGAAG GCCAGCAAGG GCAAGTTCAC CTACACCGCC AGCGCCACCT ACAAGGCTCC GGCCGGGGTG ATGCCTTACA TCACCTACGC CAAGGCCTCG GCGCTCGAGA TGAGCCAGGC CGGCGACGTC GCCGCCAGCC TCGTCGCCGA CCAGAGCGAC GCCTGGCTGT CCAACAGCGA CCTGGCCGAG GCCGGGGTGA AGTTCCAATG GCTGAAGGGC ACCCTGGTCG GCTCGCTGGC CGGCTACCGC CAGAACCGCA CCCAGCTGAC CGGCATCAGC GGCACGCCGA CCGGCACCCG CGCCAAGGGC GTCGAGATGG AAGTCCGCTG GCTGGCCAGC GAGAACTTCA GCTTCACCTT CTCGGGCAAC ACCCAGCACA CCACGGTCAA GGGGCCGGAC AATTCGTTCC AGTACATCCC GGCCTACACC GCCGGCGTCC CGGGCTCACA GGCCTTCGGC GGCACCTATG TGGTCTGGGC CTTCAGCGGC CTGGCGGGCC GCGCGGGCGA CTACGACTAC ACCCTGATCC CCAAGTCGGT GGTCAGCCTG TACGGCGCCT ATACCAGCGA CGACCACGAC TGGGGCAAGG TCGGCGGCGC GCTGGGCGTC ACCCACGTGA CCAAGACCTC GGGCACCGTC CAGAACGCCG TGACCTACCC GGCCTACTAC GTCGCCAACG CCTCGGCCTA CTACGAGTAC GGGCCGTACA CGGTGACGGC CAACATCGAT AACCTGTTCG ACAAGCTCTA CTTCACGCCC GACGCCGACA GCTACGCCAA CCTTGGCGCG CTGCCCAGCA AGGGCCGCGA GTGGCGCGTG ACCCTGTCGC GCAAGTTCTA G
|
Protein sequence | MSNQRLLMAG SSVAVLLALS SQAFAADPVA ADTSVAVDEI VVKARDKAGL LETRPNNTVF GLDKPLLETP RSASFVSDTT LQRYGIETID GLTAVSPGTY TASFYGVPGA LNIRGTLAEN YFRGFKRIEN RGTYSTPIGA ADQIQIVRGP PTPIYGSGKV GGMLNFIPKS GKNEGGYLSE PTGEVTATYG SYNKKNATAQ LGLPVNFGPV TGGVYAYGEV EDSHSFYKGV YPRRQTGEIS ADFDLGNGWS TAFGGMKYHS DGDVQTPGWN RLTQSLIDTG TYTTGRDTTL VDSDHNGRMT LNEISGNSAN PYYYDPAFNP LYIPYYNFYN TNAAHVLDAG VGATKLSPRT VYISPADFSK TDTNTLYFDL AKTLSPSSTI KAQLFYDDQE NKRFVSYGYP AWFDSSVWEA RLTYNFENEF MDGAVKAKSF IGASYRDFSG RRRESYNSGV IALDRRDISY GATATDIIDS PFTTETGSGV LGLAWENDNK SDWQQKGVFF MSDVTVGEKL NLMVGGRYDD YDVKSHDTGV LSYQVSGEQK ASKGKFTYTA SATYKAPAGV MPYITYAKAS ALEMSQAGDV AASLVADQSD AWLSNSDLAE AGVKFQWLKG TLVGSLAGYR QNRTQLTGIS GTPTGTRAKG VEMEVRWLAS ENFSFTFSGN TQHTTVKGPD NSFQYIPAYT AGVPGSQAFG GTYVVWAFSG LAGRAGDYDY TLIPKSVVSL YGAYTSDDHD WGKVGGALGV THVTKTSGTV QNAVTYPAYY VANASAYYEY GPYTVTANID NLFDKLYFTP DADSYANLGA LPSKGREWRV TLSRKF
|
| |