Gene TM1040_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2686 
Symbol 
ID4077597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2825147 
End bp2826733 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content62% 
IMG OID638008011 
Productextracellular solute-binding protein 
Protein accessionYP_614680 
Protein GI99082526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000382125 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.980281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTC TAGGGATGAC GCGGCGTGGC GCGATGGCTG CGATGCTTGC GACGACGGCA 
ATGGCGGGGG TGGCGATGGG CGTGGCGCCT GCCGCAGCGC AGACACCGCC TGGCGTGCTG
ATCGTGGGCC AGATCGCAGA GCCAAAAGCG CTGGACCCGG CGGCAGTGAC GGCGGTAAAT
GACTTCCGCA TCCTGATGAA CGTCTATGAC GGTCTGGTGC GCTACAAGGA CGGCACGCTC
GAGGTCGAAC CCGCGCTGGC GACCGACTGG AGCATCTCCG AAGATGGCAC CGAATATACA
TTCACGCTGC GCGAAGGGGT GTCGTTTCAT GACGGCAGCG CCTTTGATGC CGAGGCGGTG
GTGTTCAACT TTGAGCGCAT GCTCAATGAG GATCACCCCT ATCACAACAC CGGCCCCTTC
CCGCTGGCCT TCTTCTTTTC TGCCGTGGAG AGCGTCGAGG CCGTTGATGA TCTGACGGTG
AAATTCAAAC TGAACGCGCC CTATGCGCCG TTCCTGTCGA ATCTCGCTTA TCCTACAGGC
CTGATTGTAT CGCCTGAGGC GGTCAAGACC CATGGCGCGG AGTTCGGCCG CAACCCCTCC
GGCACCGGTG CTTTCAAATT TGCCGAGTGG CGCTCCAATG AGGCCGTGGT GGTCGAGAAA
AATCCCGACT ACTGGGATGG CGCGGCAGAG CTGGACGCGG TGGTCTTTCG CCCGATCACC
GATGCCAACA CCCGCACGGC AGAAATGCTG GCAGGTGGCA TTGATCTGAT GGTCGAGGTG
CCGCCGGTGG CACTGTCGGA GTTTCAGGGC GATGCTTTCA CCGTGCATGA ACAAGCCGGC
CCGCACGTCT GGTTCCTGAT CCTCAACGCC AAGGAAGGCC CCTTTGCCGA CAAGCGCGTC
CGCCAAGCGG CGAATTACGC GATCAACAAA TCCGCGATTG TGAACGATGT GCTTGAGGGC
ACGGCGGAGG TGGCCGCAGG CCCGACCCCG CCCGCCTTTG CCTGGGCCTA CAATGAAACG
CTCGAACCCT ATCCCTATGA CCCCGACAAG GCGCGGGAAC TCCTGGCCGA GGCGGGTGCA
GAAGGGGCGG AGCTGACGTT CTATGTGACC GAGGGCGGCT CCGGCATGCT CGACCCTATC
GCCATGGGCA CTGCCATTCA GGCGGATCTC AACGCCGTGG GGCTGGATGT GAAGATCGAA
ACCTACGAGT GGAACACCTT CCTGGGCGAG GTCAATCCGG GGCTGGAGGG CAAGGCCGAC
ATGGCCGAGA TGGCCTGGAT GACCAACGAC CCCGACACGC TCCCCTTCCT GGCGCTGCGC
ACCGAAGCCT GGCCTGACAA GGGCGGCTTC AACTCCGGCT ATTATTCCAA CCCGAAGGTG
GATGAGCTGT TGGAAGCGGC CCGCGTTGCG ACCGATCAGG ACGAGCGCGC CAAGCTTTAT
CAGGAGATGC AGACCATCGT GCAGGAAGAT GCGCCTTGGG TCTTTGTCGC CAACTGGAAG
CAGAATGCAG TGACCTCGGA TCGGGTGGGC GATTTTGCCC TGCAGCCCTC GTTCTTCCTG
CTGCTCGATG ATGTGACCAA GAACTGA
 
Protein sequence
MKLLGMTRRG AMAAMLATTA MAGVAMGVAP AAAQTPPGVL IVGQIAEPKA LDPAAVTAVN 
DFRILMNVYD GLVRYKDGTL EVEPALATDW SISEDGTEYT FTLREGVSFH DGSAFDAEAV
VFNFERMLNE DHPYHNTGPF PLAFFFSAVE SVEAVDDLTV KFKLNAPYAP FLSNLAYPTG
LIVSPEAVKT HGAEFGRNPS GTGAFKFAEW RSNEAVVVEK NPDYWDGAAE LDAVVFRPIT
DANTRTAEML AGGIDLMVEV PPVALSEFQG DAFTVHEQAG PHVWFLILNA KEGPFADKRV
RQAANYAINK SAIVNDVLEG TAEVAAGPTP PAFAWAYNET LEPYPYDPDK ARELLAEAGA
EGAELTFYVT EGGSGMLDPI AMGTAIQADL NAVGLDVKIE TYEWNTFLGE VNPGLEGKAD
MAEMAWMTND PDTLPFLALR TEAWPDKGGF NSGYYSNPKV DELLEAARVA TDQDERAKLY
QEMQTIVQED APWVFVANWK QNAVTSDRVG DFALQPSFFL LLDDVTKN