Gene TM1040_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2549 
Symbol 
ID4076680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2690971 
End bp2692563 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content59% 
IMG OID638007873 
ProductABC transporter related 
Protein accessionYP_614543 
Protein GI99082389 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.165477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAG CGCCAATTCT GCAGGTCAAG GACCTTCGGG TCGCCTTTCG TCAGGACGGT 
CAGCGGGTTG AGGCCGTCAA AGGGGTGTCC TTTGCCCTGT CTCGTGGCGA GACGGTGGCT
TTGGTGGGCG AGTCGGGCTC CGGCAAATCG GTCTCGGCGC TTTCGACGGT GCAGCTCCTT
GGTGATAGCG CCGAGGTCTC CGGCTCGGTG AACTATGACG GACAGGAAAT GATCGGCGCC
GATGAACGTA CCCTGCGACG GGTGCGCGGC AATGACATCT CCTTCATTTT CCAGGAGCCG
ATGACCTCTC TGAACCCGCT GCATACGATC GAAAAACAGT TGGGCGAGGC GCTTGCGCTA
CATCAGGGAG TGATGGGAGA GGACGCCCGC GCGCGTGTGC TGGACCTGTT GAATAAGGTT
GGTATCCGGG ACGCGGAAAC CCGACTGGGG GCTTATCCGC ACCAGCTTTC TGGCGGTCAA
CGTCAGCGTG TAATGATCGC AATGGCGCTC GCCAACAAAC CCGACGTTCT GATCGCGGAC
GAGCCCACAA CGGCGCTCGA TGTGACCATT CAGGCGCAGA TCCTCGATCT CCTTGCGGAT
CTGAAACAGA GCGAGGGGAT GGGGCTTTTG TTCATCACCC ATGACCTCTC GATTGTGCGC
CGTATTGCGG ATCGGGTCTG CGTGATGAAA TCCGGTGAAA TTGTCGAGGA AGGTCCAACG
GCTGAGCTGT TTGCAAACCC ACAGCACCCC TACACCCAGA AACTCCTGGC GGCAGAACCC
TCCGGTCGAC CCGCGCCGAT CCCGGAAGGG GCAACCGAAT TCGTCAGTGC CAAGGATCTC
AAGGTCTGGT TTCCCATCCA GCGGGGGCTA CTGAAGCGCA TGGTTGGCCA TGTCAAGGCG
GTCAATCCCA TGTCGCTCTC GGTGCGCGCC GGGGAAACTT TGGGAATTGT AGGCGAATCC
GGCTCGGGCA AAACCACGAT GGCACTTGCG ATCATGCGCC TTATCGCCTC CGAAGGCGAG
GTGCGCTTTC AGGATCAGGA CCTGCGCCAA TGGTCCACGC GGGACCTGCG CCGGTTGCGC
AAGGACATGC AGATCGTGTT TCAGGATCCG TTTGGCTCGC TGTCTCCGCG TATGACATGC
CAGCAGATCA TTGCCGAAGG TCTGGCAATT CATGAGGTTG ACCAACATCG TAAGCCGCGT
GATCTTGTGG CTGATGTAAT GCAGGAAGTG GGTCTCGACC CTGCCACCAT GGACCGCTAT
CCACATGAAT TCTCAGGTGG GCAGCGTCAG CGCATTGCCA TCGCCCGTGC AATGGTTCTG
CGCCCCAAGC TGGTGGTTCT GGACGAGCCG ACCTCGGCCC TTGATATGAC GGTCCAGGTG
CAGATCGTGA ACCTGCTGCG CGATCTTCAG GCCCGCTACG GGCTTGCCTA TCTCTTCATC
AGCCATGACC TCAATGTGGT GCGCGCCATG TCGCATGACA TCCTGGTGAT GAAAGCCGGA
GACGTGATCG AGCAAGGCCC CGCCGAGGAG GTATTTGCCA ATCCCAAAGA AGAATACACA
CGGCATCTGC TCTCTGCCGT GACGGGTGCC TAG
 
Protein sequence
MTQAPILQVK DLRVAFRQDG QRVEAVKGVS FALSRGETVA LVGESGSGKS VSALSTVQLL 
GDSAEVSGSV NYDGQEMIGA DERTLRRVRG NDISFIFQEP MTSLNPLHTI EKQLGEALAL
HQGVMGEDAR ARVLDLLNKV GIRDAETRLG AYPHQLSGGQ RQRVMIAMAL ANKPDVLIAD
EPTTALDVTI QAQILDLLAD LKQSEGMGLL FITHDLSIVR RIADRVCVMK SGEIVEEGPT
AELFANPQHP YTQKLLAAEP SGRPAPIPEG ATEFVSAKDL KVWFPIQRGL LKRMVGHVKA
VNPMSLSVRA GETLGIVGES GSGKTTMALA IMRLIASEGE VRFQDQDLRQ WSTRDLRRLR
KDMQIVFQDP FGSLSPRMTC QQIIAEGLAI HEVDQHRKPR DLVADVMQEV GLDPATMDRY
PHEFSGGQRQ RIAIARAMVL RPKLVVLDEP TSALDMTVQV QIVNLLRDLQ ARYGLAYLFI
SHDLNVVRAM SHDILVMKAG DVIEQGPAEE VFANPKEEYT RHLLSAVTGA