Gene TM1040_3682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3682 
Symbol 
ID4075651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp738877 
End bp740649 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content62% 
IMG OID638005202 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_611911 
Protein GI99078653 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.732686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.333976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCCC TTTTGTCGGT CGAAAATCTC AGCATTGGCT TTGGTCGCGA TGCACCTGTG 
GTGCAGAACG TGAATTTCGA AGTGAATCCG GGTGAAACAC TGGCGCTGGT CGGTGAAAGT
GGCTCTGGTA AGACCATCAG CTGCCGCGCG GTGCTGCGCA TTCTGCCGCG CACTGCACGC
CTGCATTCGG GTCGCATGAT CCTGCGCGGC GACCAGGATG AGCTGGATCT GGCCCGCATC
AGCGAGCGAC AAATGCGCCA TGATGTGCGC GGAAACCGCA TTGCGATGAT CTTTCAGGAG
CCGATGCGCT CTTTGTCTCC GCTGCATCGG ATCGGCAATC AGGTGGTGGA GATCATCCAT
CTTCACAGAA GCGTCTCTGA AGAAGCGGCC AAACGCGAGG TGCTGGAGTG TTTCGAGCGC
GTTGGGTTCC CCGATCCCGA GCGCACCTGG CGCTCCTATC CGTTCGAGCT TTCGGGCGGC
ATGCGCCAGC GCGCAATGAT CGCCATGGCC ATGGTTGCCA AGCCGGATCT GCTGATCGCG
GATGAACCCA CAACTGCGCT TGATGTGACC ACTCAGGCAC AGGTTCTGGG GCTGATGAAG
GATCTGCAGC GCGAGACCGG CATGGCCATG GTCCTCGTCA CCCATGATCT TGGCGTGGTG
GCCAATATGG CCGAGCAGGT GGTGGTGATG CACAAGGGTC GTGTCATGGA GGCAGGCCCA
GCCGAACCTA TCCTGCGCGC GCCCGCCCAT CCCTACACCA AGGATCTTTT TGAGGCGGCG
CCCAAGATCC CGCCGGCGAT TTCCCCAGCG CCACAGGAGC AGCAGGATCT GATCCTCGAG
CTGCGCAATG TCACCAAGAC CTTTACCATG CGCTCGGGCA AAAGCTGGAG CAAGCCGACC
CTCGTGCGTG CCTGTGACAG CGTTGATCTG CGACTGCCGC GGGGCAAGAC CTTGGCAATT
GTTGGCGAAA GCGGTTCAGG CAAGACCACA GCTGCACGCA TTGCGCTCGG CGCGGAAACG
GTGGATGCGG GCGGCGAGGT GCTCTTTCGC CACGCCGCGG GGGCAGAGGC ACTCGAGGTG
CATGATATGG ATCGCGACGC TCGCCGCGCG TTCCAGCGGC AGGCGCAGAT GGTGTTTCAA
GACCCCTATT CCTCGCTCAG TCCGCGCCAG CGGATCTTTG ACACGCTGGC AGAGCCGCTC
GAAATCCACG GCATCGGCGC GCGCGCCGAT CACAAGGCCC GTGCAGCCGA GATGCTGCGC
CTTGTTGGGC TGCCCGGCGA TATGCTCAGC CGGTATCCGC ATGCGTTTTC CGGCGGTCAG
CGCCAGCGCC TCTCTATAGC CCGAGCGCTG ATGCTCGACC CGGCTCTTCT GGTATGTGAC
GAGCCGACCT CGGCGCTGGA TGTCTCGGTG CAGGAACAGA TCCTGACCCT GCTGGAAGAA
ATCCGCGACG CGCGCCAGCT TTCCTATCTC TTTATCAGCC ACGACCTTGC GGTGGTGGCC
CGGATCGCCG ATGAGGTCGC GGTGATGCGG CGTGGCCTCA TTGTGGAACA GGGCCCGCCC
GAAGTGCTGT TTCACAACCC CAAACACCCC TACACCAAGG CACTTATTGC TGCCCAGCCG
GTCCCCGATG TGGATCGCCC CATCAATCTC AAACTTGTGG CACAGGGCGC AGGCGCGCCC
GAAAGCTGGC CGGAGGCCTT CCGCTTTGCC GGAGAGAATG CCCCGCCGTT GGTGCCACTG
GATCCCGGAC ATAAGGTACG CTGTCATGTC TAA
 
Protein sequence
MRPLLSVENL SIGFGRDAPV VQNVNFEVNP GETLALVGES GSGKTISCRA VLRILPRTAR 
LHSGRMILRG DQDELDLARI SERQMRHDVR GNRIAMIFQE PMRSLSPLHR IGNQVVEIIH
LHRSVSEEAA KREVLECFER VGFPDPERTW RSYPFELSGG MRQRAMIAMA MVAKPDLLIA
DEPTTALDVT TQAQVLGLMK DLQRETGMAM VLVTHDLGVV ANMAEQVVVM HKGRVMEAGP
AEPILRAPAH PYTKDLFEAA PKIPPAISPA PQEQQDLILE LRNVTKTFTM RSGKSWSKPT
LVRACDSVDL RLPRGKTLAI VGESGSGKTT AARIALGAET VDAGGEVLFR HAAGAEALEV
HDMDRDARRA FQRQAQMVFQ DPYSSLSPRQ RIFDTLAEPL EIHGIGARAD HKARAAEMLR
LVGLPGDMLS RYPHAFSGGQ RQRLSIARAL MLDPALLVCD EPTSALDVSV QEQILTLLEE
IRDARQLSYL FISHDLAVVA RIADEVAVMR RGLIVEQGPP EVLFHNPKHP YTKALIAAQP
VPDVDRPINL KLVAQGAGAP ESWPEAFRFA GENAPPLVPL DPGHKVRCHV