Gene TM1040_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2251 
Symbol 
ID4077318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2365793 
End bp2366992 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content63% 
IMG OID638007573 
Productputative ABC transporter solute-binding protein 
Protein accessionYP_614245 
Protein GI99082091 
COG category[R] General function prediction only 
COG ID[COG4134] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.11793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA TCTTGTCCCT TGCCCTCGCC CTTGCGCCGC TTTCGGCCTG GGCCACACCC 
GATCCGGCGG ATTGGCCCGC CGTGGTGGAA GATGCAAAAG GGCAGACGGT GCATTGGCAT
GCCTGGGGCG GCTCCACCGC GACCAATGAC TTTATCGCCT GGGTGGGCGA ACGTCTTGAG
GACGACTACG ACATCACCCT CAACCATGTG AAACTCGAAA GCACCGCCGA TGCGGTCACG
CGGGTGCTGA CCGAGAAATC CGCGGGCCAG GATGACGATG GCGCGGTGGA TCTCATCTGG
ATCAACGGGG CGAATTTCGC CGCGATGAAG GACGCCGATC TGTTGTTTGG CCCCTTTGCC
GAGGCCCTGC CCAACTGGCA GCTGGCCGAC ACCGAAAACA AGACGCTGCA GCATGATTTC
ACCGTCCCCA CCGAAGGCTA CGAGTCCCCA TGGGCGATGG CGCAGGTGGT CTTCATGCAT
GACACCGCCG ACCTGCCCGA GCGGCTTGGC TCGATGGAGG CGCTGTTGGA CTGGGCGCGC
GAACATCCCG GTCGCTTCAC CTATCCGCAG CCCCCGGATT TCCTCGGCAC CACCTTTCTG
AAACAGGCAC TGGTTGACCT GAGCGACAAT GCCGATGCGC TGTCCAAGCC CGTGAATGAA
GACAACTACC AAGAGGTCAC TGCCCCGCTC TGGGCCTTTC TCGAAGAGCT GACGCCGCTG
TTGTGGCGCG AGGGGCGCGC CTATCCCGCC ACTGGCCCGC GTCAGCTGCA GTTGATGAAT
GACGACGAGA TTGATCTCGC GATCTCTTTC AGCCCCGGCG AAGCGAGCAC CGCCATCGCC
AACTACCAGC TGCCCGAAAG CGTGCGCACC TTCGTGCTCG ACAAGGGCAC GATCGGCAAT
GCATCCTTTG TGGCGATCCC CTATAATTCC GGCTCCAAGG CGGCCTCCAT GGTGGTTGCG
AACTTCCTGA TGTCGCCCGA GGCCCAACTG CGCGCCCAGG ACCCGGACGT TCTGGGCTAT
GGCACCGTGC TCGATCTCAA TGCGCTCTCG GTGCAGGACC GCGCGGCCTT CCGCACGCTC
GATCTCGGGA TCGCCACCCT GACCCCCGAG GAACTCGGCC CTGTGCAGCC TGAACCGCAC
CCCAGCTGGA TGACCCGCAT CTCCGAGGAC TGGGTGGCGC GCTACGGCGT TGGCAACTGA
 
Protein sequence
MKRILSLALA LAPLSAWATP DPADWPAVVE DAKGQTVHWH AWGGSTATND FIAWVGERLE 
DDYDITLNHV KLESTADAVT RVLTEKSAGQ DDDGAVDLIW INGANFAAMK DADLLFGPFA
EALPNWQLAD TENKTLQHDF TVPTEGYESP WAMAQVVFMH DTADLPERLG SMEALLDWAR
EHPGRFTYPQ PPDFLGTTFL KQALVDLSDN ADALSKPVNE DNYQEVTAPL WAFLEELTPL
LWREGRAYPA TGPRQLQLMN DDEIDLAISF SPGEASTAIA NYQLPESVRT FVLDKGTIGN
ASFVAIPYNS GSKAASMVVA NFLMSPEAQL RAQDPDVLGY GTVLDLNALS VQDRAAFRTL
DLGIATLTPE ELGPVQPEPH PSWMTRISED WVARYGVGN