Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2543 |
Symbol | |
ID | 5899998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2757870 |
End bp | 2760260 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641563034 |
Product | TonB-dependent receptor |
Protein accession | YP_001684168 |
Protein GI | 167646505 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.546908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACA CGCGTACCTC GCCGGCCTCA CGCCATTTCC GGGCCTTGCT GCTCGCCGCC ACCGTTCTGG GCGGCGCGAC GCCCGTCCTG GCCCAGGAGG CCGATAAGAC TTCAACCGTC GAGGAGGTGG TCGTCACCGG CAGCCGGGTG TCCGAAGCCA GCGTCGCCAT CGGCACCGAC CACGCCACCG CCACGGTCTC GATCACCCGC GAGGCCCTGC TGTCGGCGCC CGCCGGCGTG ACCGGCCTGA AAATGCTGGA GTCCCTGCCC GGCTTCAACG TCCAGGCCAA CGACGCCCTG GGCATGTACG AGTTCGGCAA TTCGGTCTCG GTGCGGGCCT TCAACTTCCA GCAGATCGGC TTCCTGCTCG ACAACATTCC AATGGGCCGC AGCGACCAGT TCGGCGGCAG CCCGATCTAT CGTTATGTCG ACAATGAGAA CCTCAACCGC GTGACAGCCT CGGCCGGGGC CGGCGACGTC TCCCTGCCCA GCTACGCCTC GCTGGGTCCC ATCGTCGACT ACTTCACCCA GAAGCCTTCG GACGAGGCCG GCGGCGCGGC CAGCGCCACC CTGGGCAGCG ACGCCCTCAA GCGCGGCTTC CTGCGCCTGG AGACCGGCAA GATCGGACCC GTCTCGGCCT ATGTCAGCGG CTCGTGGATC AAGGGCGACC TATGGCGCGG CCCGGGCACG ATCGACCGCA AGCACTATGA AGGCAAGCTG AACTACGAAC TGCCCAACGG CGGCGACATC AGCTTCCAGA CCGTCCACAA TGACTATTAC GATTATGACA GCCCCTCGAT CACCAAGGCC CAATACGCGG GCACGGCGGG TGACGTCTTC GGGCGCTCGG GTCGCAGCTT CGCCTATCTC GGCGAGGTGC CGCTGTCGGT CCCGCTGGGC ACGCCGGTGC TCATTCCGAC CGCCAGCCTA CCGCAGACCG TGGCGGGGAT CGTCTATTCC AACCCCAACT ACGCCAACTA CTACAAGTTC GCGGTCAACA AGAGGAAGGA CCACCTCTAC GGCCTGACCC TGACCACGCC GATCACCGAC ACGATCGACC TGACAACGAC GGCCTACTAC GAAGACAAGG GCGGCTATGG CGTCTCGCCC GAGGACTATG CGACGTCCAA GGGCAACTAC GACGCCGAGA TTCTGGCCGG TCTCACGGGA CTGACCGCGC CCAAGGGCTT GCAATACGGC CTGTCGGCGA TCGACGGCAC GCGCAAGGGC GTCACGGCCA AGGGCAGCTG GAAGGTCGGC TTCAACACGT TCGAGGCCGG GGTCTGGCTC GAGAAGGACG ACTATCACCG CACCCAGGCG CGCTACAACA CCGTCAACGG CGACCCCGAC GGCGCGCCGT TGCTGAACGA ACCAGTGCAC CTGCAACGAG ACTATGTCTC GACCCGCGAC ACCACCCAGT TCTTCCTGAA GGACACCCTG AGCCTGCTGG ACGACGCGCT GAAACTGGAA CTCGGCTTCA AGACGCTCGA CGTCGACTAC AACATCCACG GCAAGCGCCA GATCGCCGAC TACCGGACGG GCCGTACGCC GTCGATCGAC GCCAAGTGGA AGGACAACTT CCTGCCCCAG GTCGGCCTAG TCTACAGCGT CAGCAGCCGC GACCAGGTGT TCGCCTCTTA TTCGGAGAAC ATGGCCCTGC CGCGCGGCGC CGACGATGTG TTCTCGGCCG CCAGCCCGTC CGCGCCCGGT CCCAAGCCGG AAACCTCGAC CAACGTGGAA CTGGGCTATC GGGCCAATCG CGCGACCTTC AACGCCTCGT TCGTGGTCTA CAAGACCGAG TTCAAGAACC GGCTGCAGCA GTTCAACGCG GTGGTGCCCG GCAGCACCAC GCTGGAAAGT TTCTATCAGA ACGTCGGGGC GGTGAAGGCG TCCGGCGCCG AGTTCAGCGG CCAGTGGAAG CCGGAACTGC TGGGCGGCAA GATCTATTTC AACGCCAACG CCTCGTACAA TAAGTCGGAG TTCCAGGACG ACGTCCTGAA CTACCGGTCC AGCGCCACGG CCACGCCGGT GACCCTGTCG ACCAAGGGCA AGGCGGTGCC CGACTTCCCC GAGTGGCTGT TCCAGGGCGG GGTGACGGTC GAGCCGACGG ACGGCGTGGT GTTCAACCTC TCGGCCCGCC ACATCGACGA CCGCTTCACC AACTTCATCA ACAGCGAGAG CACCAAGGCC TATACCCTGT GGAACGCCTA TCTGGACCTG GGCGACGGCT TCGCGGCCGG GCCATTCAAG CAGGTCAAGA CCCGGGTCAA TATCGATAAC ATCTTCGACA AGGACTATCT GGGCACGATC AACACCACGG TGAACACCGC CGCCAGCTTC CGGCCCGGCT CGCACCGCAC GATCCAGTTC ACCGTCTCCG CCGACTTCTA G
|
Protein sequence | MTNTRTSPAS RHFRALLLAA TVLGGATPVL AQEADKTSTV EEVVVTGSRV SEASVAIGTD HATATVSITR EALLSAPAGV TGLKMLESLP GFNVQANDAL GMYEFGNSVS VRAFNFQQIG FLLDNIPMGR SDQFGGSPIY RYVDNENLNR VTASAGAGDV SLPSYASLGP IVDYFTQKPS DEAGGAASAT LGSDALKRGF LRLETGKIGP VSAYVSGSWI KGDLWRGPGT IDRKHYEGKL NYELPNGGDI SFQTVHNDYY DYDSPSITKA QYAGTAGDVF GRSGRSFAYL GEVPLSVPLG TPVLIPTASL PQTVAGIVYS NPNYANYYKF AVNKRKDHLY GLTLTTPITD TIDLTTTAYY EDKGGYGVSP EDYATSKGNY DAEILAGLTG LTAPKGLQYG LSAIDGTRKG VTAKGSWKVG FNTFEAGVWL EKDDYHRTQA RYNTVNGDPD GAPLLNEPVH LQRDYVSTRD TTQFFLKDTL SLLDDALKLE LGFKTLDVDY NIHGKRQIAD YRTGRTPSID AKWKDNFLPQ VGLVYSVSSR DQVFASYSEN MALPRGADDV FSAASPSAPG PKPETSTNVE LGYRANRATF NASFVVYKTE FKNRLQQFNA VVPGSTTLES FYQNVGAVKA SGAEFSGQWK PELLGGKIYF NANASYNKSE FQDDVLNYRS SATATPVTLS TKGKAVPDFP EWLFQGGVTV EPTDGVVFNL SARHIDDRFT NFINSESTKA YTLWNAYLDL GDGFAAGPFK QVKTRVNIDN IFDKDYLGTI NTTVNTAASF RPGSHRTIQF TVSADF
|
| |