Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3682 |
Symbol | |
ID | 4075651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 738877 |
End bp | 740649 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005202 |
Product | oligopeptide/dipeptide ABC transporter, ATP-binding protein-like |
Protein accession | YP_611911 |
Protein GI | 99078653 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.732686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.333976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCCCC TTTTGTCGGT CGAAAATCTC AGCATTGGCT TTGGTCGCGA TGCACCTGTG GTGCAGAACG TGAATTTCGA AGTGAATCCG GGTGAAACAC TGGCGCTGGT CGGTGAAAGT GGCTCTGGTA AGACCATCAG CTGCCGCGCG GTGCTGCGCA TTCTGCCGCG CACTGCACGC CTGCATTCGG GTCGCATGAT CCTGCGCGGC GACCAGGATG AGCTGGATCT GGCCCGCATC AGCGAGCGAC AAATGCGCCA TGATGTGCGC GGAAACCGCA TTGCGATGAT CTTTCAGGAG CCGATGCGCT CTTTGTCTCC GCTGCATCGG ATCGGCAATC AGGTGGTGGA GATCATCCAT CTTCACAGAA GCGTCTCTGA AGAAGCGGCC AAACGCGAGG TGCTGGAGTG TTTCGAGCGC GTTGGGTTCC CCGATCCCGA GCGCACCTGG CGCTCCTATC CGTTCGAGCT TTCGGGCGGC ATGCGCCAGC GCGCAATGAT CGCCATGGCC ATGGTTGCCA AGCCGGATCT GCTGATCGCG GATGAACCCA CAACTGCGCT TGATGTGACC ACTCAGGCAC AGGTTCTGGG GCTGATGAAG GATCTGCAGC GCGAGACCGG CATGGCCATG GTCCTCGTCA CCCATGATCT TGGCGTGGTG GCCAATATGG CCGAGCAGGT GGTGGTGATG CACAAGGGTC GTGTCATGGA GGCAGGCCCA GCCGAACCTA TCCTGCGCGC GCCCGCCCAT CCCTACACCA AGGATCTTTT TGAGGCGGCG CCCAAGATCC CGCCGGCGAT TTCCCCAGCG CCACAGGAGC AGCAGGATCT GATCCTCGAG CTGCGCAATG TCACCAAGAC CTTTACCATG CGCTCGGGCA AAAGCTGGAG CAAGCCGACC CTCGTGCGTG CCTGTGACAG CGTTGATCTG CGACTGCCGC GGGGCAAGAC CTTGGCAATT GTTGGCGAAA GCGGTTCAGG CAAGACCACA GCTGCACGCA TTGCGCTCGG CGCGGAAACG GTGGATGCGG GCGGCGAGGT GCTCTTTCGC CACGCCGCGG GGGCAGAGGC ACTCGAGGTG CATGATATGG ATCGCGACGC TCGCCGCGCG TTCCAGCGGC AGGCGCAGAT GGTGTTTCAA GACCCCTATT CCTCGCTCAG TCCGCGCCAG CGGATCTTTG ACACGCTGGC AGAGCCGCTC GAAATCCACG GCATCGGCGC GCGCGCCGAT CACAAGGCCC GTGCAGCCGA GATGCTGCGC CTTGTTGGGC TGCCCGGCGA TATGCTCAGC CGGTATCCGC ATGCGTTTTC CGGCGGTCAG CGCCAGCGCC TCTCTATAGC CCGAGCGCTG ATGCTCGACC CGGCTCTTCT GGTATGTGAC GAGCCGACCT CGGCGCTGGA TGTCTCGGTG CAGGAACAGA TCCTGACCCT GCTGGAAGAA ATCCGCGACG CGCGCCAGCT TTCCTATCTC TTTATCAGCC ACGACCTTGC GGTGGTGGCC CGGATCGCCG ATGAGGTCGC GGTGATGCGG CGTGGCCTCA TTGTGGAACA GGGCCCGCCC GAAGTGCTGT TTCACAACCC CAAACACCCC TACACCAAGG CACTTATTGC TGCCCAGCCG GTCCCCGATG TGGATCGCCC CATCAATCTC AAACTTGTGG CACAGGGCGC AGGCGCGCCC GAAAGCTGGC CGGAGGCCTT CCGCTTTGCC GGAGAGAATG CCCCGCCGTT GGTGCCACTG GATCCCGGAC ATAAGGTACG CTGTCATGTC TAA
|
Protein sequence | MRPLLSVENL SIGFGRDAPV VQNVNFEVNP GETLALVGES GSGKTISCRA VLRILPRTAR LHSGRMILRG DQDELDLARI SERQMRHDVR GNRIAMIFQE PMRSLSPLHR IGNQVVEIIH LHRSVSEEAA KREVLECFER VGFPDPERTW RSYPFELSGG MRQRAMIAMA MVAKPDLLIA DEPTTALDVT TQAQVLGLMK DLQRETGMAM VLVTHDLGVV ANMAEQVVVM HKGRVMEAGP AEPILRAPAH PYTKDLFEAA PKIPPAISPA PQEQQDLILE LRNVTKTFTM RSGKSWSKPT LVRACDSVDL RLPRGKTLAI VGESGSGKTT AARIALGAET VDAGGEVLFR HAAGAEALEV HDMDRDARRA FQRQAQMVFQ DPYSSLSPRQ RIFDTLAEPL EIHGIGARAD HKARAAEMLR LVGLPGDMLS RYPHAFSGGQ RQRLSIARAL MLDPALLVCD EPTSALDVSV QEQILTLLEE IRDARQLSYL FISHDLAVVA RIADEVAVMR RGLIVEQGPP EVLFHNPKHP YTKALIAAQP VPDVDRPINL KLVAQGAGAP ESWPEAFRFA GENAPPLVPL DPGHKVRCHV
|
| |