Gene TM1040_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1420 
Symbol 
ID4078050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1514770 
End bp1516857 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content59% 
IMG OID638006730 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_613415 
Protein GI99081261 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component
[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain
[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0353718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.112708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC TCGCAGACTA TGACGGCCCG ATCCTTGAAA TCGACAAGCT GTCGATCTCA 
TTTTTTACAA GGCTGCGCGA GATTCCAGCG GTGATGGACT TTTCGGTCGC CGTGCAACCG
GGCGAAGCGG TTGGGCTGGT CGGAGAATCC GGCTGCGGCA AATCCACCGT GGCGTTGGGC
GTCATGCAGG ATCTGGGCAA GAACGGGCGC ATCGTCGGTG GTTCGATCAA GTTCAAGGGC
CGTGATCTCG CGGAGATGAG CGCCGAGGAG CTGCGAGATG TGCGCGGCAA CGAGATCGCG
ATGATCTATC AGGAGCCGAT GGCCTCGCTC AATCCGGCGA TGAAAATCGG CAAGCAACTG
ATGGAAGTGC CGATGATTCA TGAGGGTGTC AGCGAGAAAG AGGCCTATGA TCGCGCGCTC
GAGGTGGTCA CGGATGTAAA ACTGCCGGAT CCCAAGCGGA TGCTGAATTC CTATCCGCAT
CAGCTCTCGG GCGGGCAGCA GCAGCGGATC GTCATTGCAA TGGCACTGAT GTCCAAGCCT
GCACTCTTGA TCCTCGATGA GCCTACCACC GCACTTGATG TAACTGTTGA GGCCGCCGTG
GTCGAACTGG TCAAGGATCT GGGCAAGAAA TACGGCACCT CAATGCTGTT TATCTCGCAC
AACCTCGGGC TTGTTCTTGA GACTTGTGAC AGGATTTGCG TGATGTATTC CGGTGAGGCG
GTGGAGCGCG GGTCGATTGA GGATGTCTTT GACCACATGC GCCACCCCTA CACGCAGGCC
TTGTTCCGCT CAATTCCCCT GCCGGGCGCC GACAAGAACG CGCGTCCGCT GGTGGCGATC
CCCGGCAACT TTCCCCTGCC CCATGAACGC CCGCGCGGCT GCAACTTTGG CCCACGCTGT
GACTATTTCG AAGCCGGGCG CTGTGATGCA AGCGACATCG CAATGGCGGC GGTGCCGGGC
AATGAACGCC ACCATACCCG CTGTCTGCGC TTTGAAGAAA TCGACTGGAA CGCACCGCTC
GCTCTGGCGG AGCAAACCAG CAAGACAGAG CCCGGGCGGG TTGTGCTCAA GATGGACAAG
CTCAAGAAAT ATTACGAGGT CGCGGCCAAT GCGCTGTTTG GGGGCGGCGC GCGAAAGGTT
GTCAAAGCCA ATGAGACACT GAGCTTTGAG GCGCGCGAAT CCGAGACGCT TGCCATCGTG
GGCGAGTCCG GCTGCGGTAA ATCCACATTT GCCAAGGTAC TGATGGGGCT GGAAACAGCG
ACCGAGGGCG AGATCCTTCT GGACGACCGC AACATCGAGG ACGTGCCGAT CGAGGCGCGG
GATGCAAAGA CCATTTCCGA TGTGCAAATG GTGTTCCAGA ACCCCTTCGA CACGCTCAAT
CCTTCGATGA CGGTCGGGCG GCAGATCATC CGGGCGCTGG AGATCTTTGG CATCGGTGAC
AGCGATGGCG CAAGAAAGCA GCGCATGCTC GAACTTCTGG ATCTGGTGAA GCTGCCACGT
GCCTTTGCCG ACCGGATGCC GCGACAGCTC TCGGGCGGGC AGAAACAGCG GGTTGGTATT
GCCCGTGCCT TTGCCGGGGG CGCGCGGATC GTGGTTGCGG ATGAACCCGT CTCGGCGCTG
GATGTGTCGG TGCAGGCTGC TGTGACCGAT CTCCTGATGG AGATCCAGCG CAATGAGAAG
ACGACCCTGC TCTTTATCAG TCACGATCTC TCGATTGTGC GCTATCTCAG CGATCGGGTG
ATGGTGATGT ATCTCGGTCA TGTGGTCGAG CTTGGCGAAA CCGAGCAGGT GTTCTCACCG
CCCTATCACC CCTATACCGA GGCGCTGTTG TCTGCTGTGC CAATCGCGGA CACATCGGTC
GAAAAGACCC ATATCGTGCT TGAGGGGGAT ATCCCTTCGG CCATGAACCC GCCCTCCGGG
TGCCCGTTTC AGACTCGTTG CCGTTGGAAA TCGAAGGTTC CAGGTGGCCT CTGCGAGGCT
GAGGTGCCGC CCATCCAGAC CCTTGAAAAC GGGCACCAGA TCAAATGCCA TCTGAGCGGC
GAAGTTCTGG AGAGTATGGA GCCGGTCATC AAGATCGCGG CGGAGTGA
 
Protein sequence
MSKLADYDGP ILEIDKLSIS FFTRLREIPA VMDFSVAVQP GEAVGLVGES GCGKSTVALG 
VMQDLGKNGR IVGGSIKFKG RDLAEMSAEE LRDVRGNEIA MIYQEPMASL NPAMKIGKQL
MEVPMIHEGV SEKEAYDRAL EVVTDVKLPD PKRMLNSYPH QLSGGQQQRI VIAMALMSKP
ALLILDEPTT ALDVTVEAAV VELVKDLGKK YGTSMLFISH NLGLVLETCD RICVMYSGEA
VERGSIEDVF DHMRHPYTQA LFRSIPLPGA DKNARPLVAI PGNFPLPHER PRGCNFGPRC
DYFEAGRCDA SDIAMAAVPG NERHHTRCLR FEEIDWNAPL ALAEQTSKTE PGRVVLKMDK
LKKYYEVAAN ALFGGGARKV VKANETLSFE ARESETLAIV GESGCGKSTF AKVLMGLETA
TEGEILLDDR NIEDVPIEAR DAKTISDVQM VFQNPFDTLN PSMTVGRQII RALEIFGIGD
SDGARKQRML ELLDLVKLPR AFADRMPRQL SGGQKQRVGI ARAFAGGARI VVADEPVSAL
DVSVQAAVTD LLMEIQRNEK TTLLFISHDL SIVRYLSDRV MVMYLGHVVE LGETEQVFSP
PYHPYTEALL SAVPIADTSV EKTHIVLEGD IPSAMNPPSG CPFQTRCRWK SKVPGGLCEA
EVPPIQTLEN GHQIKCHLSG EVLESMEPVI KIAAE