Gene TM1040_2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2684 
Symbol 
ID4077595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2822202 
End bp2824085 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content64% 
IMG OID638008009 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_614678 
Protein GI99082524 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component
[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00440045 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.764263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC TGCGTCTTTT GTCCCGCAAC CGCCTCGCGC TGGCGGGGCT CATCGTGATG 
TCGGTGGTGC TGCTGCTGGC GGTGCTGACA CCCATTTTGC CGCTGCCCGA CCCGGATGTG
ACCAACACTG CAGAGCGGTT CAAGAAACCC TTTAGCGAGG GGGCCTTGCT GGGCACTGAC
CACCTTGGTC GGGATCTCGC CAGCCGCCTG ATGTGGGGCA CGCGGCTGTC GCTGGCAGTG
GGCTTTGCGG CAGCGGTGGC GGCGGCCACC ATCGGGGCGG CCATCGGCGT GATCGCCGGT
TTTTATGGCG GGCGCGTGGA CAATGTGATC ATGCGCGGCG TCGATATGCT GATGGGCTTT
CCCTATATCC TCCTGGCGCT GGCGATTGTC GCAGCACTTG GTCCGGGGCT GATGAATGCG
CTGATCGCTG TGGCCGCCGT CAACATTCCC TTCTTCGCGC GCAACATTCG CGGTGTCACC
GTCGGCATTG CGCACAAGGA ATTTATTGAT GCGGCGCGGC TGTGCGGGAT GTCGAATGCG
CGCATTATCA TCACCGAGGT GGTGCCAAAC GTGATCCCGG TGATCGTGAT CGCCATGTCC
ACCACTGTCG GCTGGATGAT CCTCGAGACG GCAGGTCTCA GTTTCCTTGG CCTTGGTTCG
CAACCGCCGC AGGCGGATCT GGGCTCCATG CTGGGGGAGG CACGCTCGGC GCTGATTACC
AATCCGCATA CCTCCGTGGT GCCCGGTGCG ATGATCCTCG TGATCGTGAT GGCGATCAAC
CTTCTGGGCG ACGGCGTGCG CGACGCGCTT GATCCGCGCC TGAAATCCGG CGCGCTCAGC
CGCCCGATGC CGACCACCAT GGTGCGCCGC ACAGACCCCG TACCGCAGCC CGAAGGCGAC
GGTATCCTGA GCCTTTGCAA CCTGCAAACC CAGTTCCACA TCAAGGATCG CATCTACAAG
GCCGTGGGCG GCGTGGATCT CTCGGTAAGG CCGGGCGAAT GCCTTGGGAT CATTGGTGAA
AGTGGCTCTG GTAAATCCGT GACGGCGCTG TCGATCATGG GGCTGGTGGC CTCGCCCCCC
GGTGTCATCA CCGGCGGTGC AGTGCATTAC AAGGGCGAGG ATCTGATCGG TGCGCCCTAT
GAGACCCTGC GCCGTCTGCG CGGTGACCGC GTGGCCTATA TCTTTCAGGA TCCTCTGGCG
ACGCTGCACC CGCTCTATAC GGTTGGCGCG CAGCTCATCG AGGCGATCCA GAGCCATCAT
CGCACCAGCA CCTCCGAGGC GCGTGCCCGC GCGATTGAGC TTTTGAAATC CGTGCGCATC
CCCAATGCCG AGGCGCGCGT GGACAATTAC CCGCATGAGA TGTCGGGCGG CATGCGCCAG
CGGGTCGGCA TCGCCATGGC GCTGGCCAAC GACCCTGAGG TCATCATCGC GGATGAGCCC
ACAACCGCGC TGGATGTGAC GGTGCAGGCG CAGATCCTTG CGCTCCTGGA TGACTTGCGG
CGCGAGCGGG GCTTGGCGAT CATCTTCATC ACCCATGATT TTGGCGTGGT GGCGCAGCTC
TGTGATCGGG TGGCGGTGAT GTATGCGGGC CGCATTGTCG AGGAAGGCCC CACCGATGCC
ATTCTCAATG CGCCTGCTCA CCCCTATACC GCGCGGCTGA TGGCCTGCGT GCCGGAACTG
GGCCAGGGCA GGCGCGAGCT TGCGGCCATT CCCGGTCTAC CGCCTGTGGT GGACAAACTG
CCCGCGGGGT GCGCCTTTGC GGATCGCTGC CCCAAGGCCG CCAAAGCCTG CCGCAGCGGG
GACATTGCGC TCGATGGGTT TGGTGCGGGG CGCAAGATCC GCTGTATAGA CCCTGAAATT
CCAGTGCAGG AGGCCACGGC ATGA
 
Protein sequence
MSFLRLLSRN RLALAGLIVM SVVLLLAVLT PILPLPDPDV TNTAERFKKP FSEGALLGTD 
HLGRDLASRL MWGTRLSLAV GFAAAVAAAT IGAAIGVIAG FYGGRVDNVI MRGVDMLMGF
PYILLALAIV AALGPGLMNA LIAVAAVNIP FFARNIRGVT VGIAHKEFID AARLCGMSNA
RIIITEVVPN VIPVIVIAMS TTVGWMILET AGLSFLGLGS QPPQADLGSM LGEARSALIT
NPHTSVVPGA MILVIVMAIN LLGDGVRDAL DPRLKSGALS RPMPTTMVRR TDPVPQPEGD
GILSLCNLQT QFHIKDRIYK AVGGVDLSVR PGECLGIIGE SGSGKSVTAL SIMGLVASPP
GVITGGAVHY KGEDLIGAPY ETLRRLRGDR VAYIFQDPLA TLHPLYTVGA QLIEAIQSHH
RTSTSEARAR AIELLKSVRI PNAEARVDNY PHEMSGGMRQ RVGIAMALAN DPEVIIADEP
TTALDVTVQA QILALLDDLR RERGLAIIFI THDFGVVAQL CDRVAVMYAG RIVEEGPTDA
ILNAPAHPYT ARLMACVPEL GQGRRELAAI PGLPPVVDKL PAGCAFADRC PKAAKACRSG
DIALDGFGAG RKIRCIDPEI PVQEATA