Gene TM1040_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1823 
Symbol 
ID4076969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1917425 
End bp1919062 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content62% 
IMG OID638007138 
ProductABC transporter related 
Protein accessionYP_613818 
Protein GI99081664 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0401601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.812901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CACTCCTCAA GGTCCGCGAC CTGAAAATCG GCGCCACGGT TTATCCGCCG 
GGCGAGAAAC CCCATGACAT CGAAATCGTG CATGGCGTCA GCTTTGACCT CATGCCAGGC
AAGGTGCTGG GTCTCATCGG GGAATCCGGT GCAGGTAAAT CCACCATCGG TCTGTCGTCC
ATGGCCTATG GCCGGGGCGG TGTGAAAATC ACCGGTGGCG AAGTCTGGGT CAACGGGCGC
GACATCCTCA AATCCAAGCT GAGCGACATC CGCAAGCTGC GTGGGGGTGA GGTGACCTAT
GTGTCGCAAT CTGCCGCCGC GTCGTTCAAC CCGGCCAAGA CCATCATGGA ACAGGTGATC
GAAGCCTCGG TCGAGCAGGG CAAATTCTCC CGCAGAGTGG CCGAAGACCG CGCCCGCGCG
CTCTTTGCCA AGCTGGGCCT GCCCGACCCC GACAACATCG GCGCCCGCTA TCCGCATCAG
GTGTCCGGTG GTCAGCTGCA GCGCTGCATG ACCGCACTTG CGCTCTGTCC GGAACCCGAT
CTCGTGGTCT TTGACGAGCC CACCACGGCG CTTGATGTGA CCACGCAGAT CGACGTTCTG
ATGGCGATCA AGGAAGCGAT CCGCGACACC GGTGTGGCCG CGCTTTATAT CACCCACGAT
CTTGCGGTTG TGGCACAGGT CTCTGATGAC ATCATGGTGC TGCGCCACGG CAATACCGTG
GAATACGGCT CGGTCGATCA GATCATCAAC AACCCGCAAG AAGAGTACAC GCAGGCGCTG
GTCTCCGTGC GCTCGATCGA GCACGAGGAA AAGGCCCCCA CCGAGGAGCC GATCCTGTCG
GTGCGCAACA TCACTGCGCG CTACAAGGGC ACCAAGTTCG ACGTGCTGCA CAACGTGAAC
GTCGATCTCT ACCCCGGTCA GACCCTGGCC GTGGTGGGCG AGTCCGGTTC GGGCAAATCG
ACGCTGGCGC GGGTGATCAC CGGCCTTCTG CCCCCGCGCG AAGGCGAGAT CTACTTCAAC
GGGCGCACGC TCACGCCGGA CTTCAACAAC CGCAGCCGCG AGGATCTGCG CGAGTTGCAG
ATGATCTACC AGATGGCGGA TGTGGCGATG AACCCGCGTC AGACCGTAGG CACCATCATC
GGCCGGCCGC TAGAGTTCTA TTTCGGCCTG AAGGGCGCGG AAAAGCGCAA GCGGATCATC
GAGTTGCTCG ACGAGATTGA ACTCGGGGAA GGCTTTATCG ACCGCTACCC GGCAGAGCTG
TCGGGCGGGC AGAAACAGCG TGTCTGTATC GCCCGGGCGC TGGCGGCCAA GCCCAAGATG
ATCATCTGTG ACGAGGTCAC CTCGGCGCTC GATCCACTGG TGGCGGACGG CATCCTGAAA
CTGTTGCTGA ACCTGCAAAA GATCGAGGAT GTGGCGTTTC TCTTCATCAC CCACGATCTC
GCGACGGTGC GCGCGATCTC TGACAACATC GCGGTGATGT ACAAGGGCAA GGTGCAGCGC
TACGGCGGCA AGACGCAGGT GCTGAGCCCG CCCTTTGACG ACTACACCGA CCTTCTGCTG
AGCTCGGTGC CGGAGATGAA GCTCGGCTGG CTCGAAGAGG TGATCGCCAA CCGCAAGATG
GAGAGCGCGG GCAACTGA
 
Protein sequence
MSEPLLKVRD LKIGATVYPP GEKPHDIEIV HGVSFDLMPG KVLGLIGESG AGKSTIGLSS 
MAYGRGGVKI TGGEVWVNGR DILKSKLSDI RKLRGGEVTY VSQSAAASFN PAKTIMEQVI
EASVEQGKFS RRVAEDRARA LFAKLGLPDP DNIGARYPHQ VSGGQLQRCM TALALCPEPD
LVVFDEPTTA LDVTTQIDVL MAIKEAIRDT GVAALYITHD LAVVAQVSDD IMVLRHGNTV
EYGSVDQIIN NPQEEYTQAL VSVRSIEHEE KAPTEEPILS VRNITARYKG TKFDVLHNVN
VDLYPGQTLA VVGESGSGKS TLARVITGLL PPREGEIYFN GRTLTPDFNN RSREDLRELQ
MIYQMADVAM NPRQTVGTII GRPLEFYFGL KGAEKRKRII ELLDEIELGE GFIDRYPAEL
SGGQKQRVCI ARALAAKPKM IICDEVTSAL DPLVADGILK LLLNLQKIED VAFLFITHDL
ATVRAISDNI AVMYKGKVQR YGGKTQVLSP PFDDYTDLLL SSVPEMKLGW LEEVIANRKM
ESAGN