Gene TM1040_2745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2745 
Symbol 
ID4077617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2889841 
End bp2891496 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content59% 
IMG OID638008070 
Productextracellular solute-binding protein 
Protein accessionYP_614739 
Protein GI99082585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCA CTGCTACCGC CGGGCTCTTG GCCTCGGTTG CGCTCTGGAC CACTGCCGCA 
CAGGCAGAGG AAACCGTGCT GAGCGCGTTG CCGGAACAGG TCACCGCTTG GGTGGAGAAC
TTCAACCCGT TCAACCAGAC CACCGCGGCG CCGTCCGTCA TGCATTTCAT GTACGAGCCG
CTGATCATTT TCAACGCGCT CGACGGCGGC AAGCCGATCT ACCGTCTGGC GACCGCATTT
GAGTATTCCG ATGATCTGAG TTCCATCACC GTGACCCTGC GCGATGGCGT TCAGTGGTCC
GATGGCGAAG CATTCACCGC CGATGATGTG GTCAAATCCT TTGATCTGGC GCTGAGCGAT
CCGGCCCTCG ACAGCGTCGG CATGGCGCAG ATGCTCTCTG GTGTCGAGAA ACTCGACGAG
ATGACGGTGA AGTTCAACCT CTCCACCCCT TCAAGCCAGG CCATGTACCA GATCGTGCGT
GTGCCAATCG TCCCCGAACA CGTCTGGAGC AACGTCTCCG ATCCCGTGAC CTTTACCAAC
CCTGATCCCG TGGGGTCCGG TCCGCTGACA GAGATCCGCC GTTTCACGCC GCAGGAATAC
ATCCAGTGCC GCAACAACAA TTACTGGGAC GCTGAGAGCC TCAAGGTCGA CTGCATGCGT
TTCCCGCAGA TCGCCAACAA CGATCAGGCG CTGGCAGCGG CCGCGAATGG CGAACTGGAC
TGGATGGGGT CCTTCCTGCC CGACATCGAC AACACCTTTG TCGCCAAGGA CCCCGAGCAT
CACAGCTATT GGCTGCCCGC AGGCTCTCTC GTGGCGTTCT ACATGAACTT CGAGGCCAAA
GAAGCCGGCG ACAAAGAAGC CGTGAACAAC GTCGCCTTCC GCCGCGCGGT GTCGATGGCC
TTCGACCGCG AAGCGATGGT GGAAATTGCA GGCTATGGCT ATCCGACGAT CAACCAGTAT
CCCTCTGGTC TGGGCCGCGC TTATCACGCG TGGAACAACC CCGAAGTCGA GGACAAATTT
GGCGCGTTTA CCCAATATGA CATCGAGGGC GCCAAGGCAC AGCTGGCCGA GGCCGGGTTC
AAGGACATTG ACGGTGACGG CTTTGTGGAA ACCCCAAGCG GTGAGCAGAT CGACATTGAA
GTCATCGTGC CCAACGGCTG GACCGACTGG GTCAACAGCA GCCAGATCGC GGTCGAGGGC
CTGAATGCAG CCGGGATCAA GGCCAATGTC TCAACACCTG AATCCGCGAT CTGGACCGAA
AAGCTGATCA AGGGCGACTA TGACATGGCG ATCAACTCGG TTCGTGTTGG TGCGACCCCC
TTCAACCAGT ATCTGGACTC GCTCCACGAG ATTAATCAGG CCAAGTCGCG TTTTGCCGCG
TCGCGGTACT ACAACGAAGA GCTGAGCGAC CTTCTGGATG CCTTCACCCA GACCAGCGAC
ACCGACAAGC AGATGGCGAT CATGTCCGAT GTACAAGAGA TCGTCGGTGA AGACATGCCG
CTGGCCTATG TGTTCAACAA CCCGCGCTGG TATCAGTACA ACACCAAGCG TTTCGAAGGC
TTCTTCAACG CTGACAACCC GGTGGCCAAC CCGGTGGTTC ACAAAACCAA CCCGGCCCGT
CTGATCCACC TCCTGAACCT GCGCCCGGTC GAGTAA
 
Protein sequence
MKFTATAGLL ASVALWTTAA QAEETVLSAL PEQVTAWVEN FNPFNQTTAA PSVMHFMYEP 
LIIFNALDGG KPIYRLATAF EYSDDLSSIT VTLRDGVQWS DGEAFTADDV VKSFDLALSD
PALDSVGMAQ MLSGVEKLDE MTVKFNLSTP SSQAMYQIVR VPIVPEHVWS NVSDPVTFTN
PDPVGSGPLT EIRRFTPQEY IQCRNNNYWD AESLKVDCMR FPQIANNDQA LAAAANGELD
WMGSFLPDID NTFVAKDPEH HSYWLPAGSL VAFYMNFEAK EAGDKEAVNN VAFRRAVSMA
FDREAMVEIA GYGYPTINQY PSGLGRAYHA WNNPEVEDKF GAFTQYDIEG AKAQLAEAGF
KDIDGDGFVE TPSGEQIDIE VIVPNGWTDW VNSSQIAVEG LNAAGIKANV STPESAIWTE
KLIKGDYDMA INSVRVGATP FNQYLDSLHE INQAKSRFAA SRYYNEELSD LLDAFTQTSD
TDKQMAIMSD VQEIVGEDMP LAYVFNNPRW YQYNTKRFEG FFNADNPVAN PVVHKTNPAR
LIHLLNLRPV E