Gene TM1040_2713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2713 
Symbol 
ID4077020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2857581 
End bp2859170 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content60% 
IMG OID638008038 
Productextracellular solute-binding protein 
Protein accessionYP_614707 
Protein GI99082553 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACACC TGAAGACAAC TCTGCTCGCC AGCGCCTTGA TGCTGCCGCT TGCAGCCCCT 
GCCGTACTGG CTGATACGCC CGAAGGCGTG CTCGTGGTTG CACAGAACAT CGACGATGTC
GTCGCCATCG ACCCGGCGCA GGCCTATGAG TTCACCTCCG GCGAGCTCGT GACCAACCTC
TATGACCGTC TGGTGCAATA CGATGCCGAA GACACCACCG TTCTGGCCGC AGGTCTGGCC
TCGGAATGGG TCACCGATGC GGATGCCAAG ACCATCACTT TCACCCTGCG CGATGGGGCG
ACCTTTGCCT CTGGCAATCC CGTGACCGCA GAAGATGTGG TCTATTCCTT CTCCCGCGTG
GTGAAGCTGA ACCTGACCCC GGCGTTCATC CTGACTCAAC TGGGCTGGAC GGCAGATAAC
ATCGGCGAAA TGGTCACGGG CGAGGGCAAC ACCGTCACCG TGAAATACGC CGGCGACTTC
TCTCCGGCGT TTGTACTGAA CGTCCTGGCG GCACGTCCTG CCTCCATCGT CGACAGCAAG
CTGGTGCAGG AAAACGAAGT TGACGGCGAC ATGGGCAATG CCTGGCTCAA CGCCAATGCC
GCCGGTTCTG GCCCCTTCAC GCTGCAACGC TATGCGGCGG GTCAGATGGT GCGCATGCAG
GCCAATCCGA CCTATTTCAA CGGCGCGCCC AAGATCGACA GCGTGATCAT CCGCCATGTG
GCCGAAAGCG CGACCCAGCA GCTTTTGCTG GAGCAGGGCG ACGTGGATCT GGCCCGCAAC
ATGACACCCG ATCAGGTGGC TTCCCTTGAA AGCGGCGAGA TCAAGGTCGA GACATACCCG
CAGGCGGCTG TGCATTTCCT GTCGTTCAAC CAGAAGACCG AGAGCCTTAC GCCCCCCGCC
GTTTGGGAAG CCGCGCGCTA TCTGGTGGAC TACAAGGGCA TGACCGAGAC CATCATCAAA
GGTCAGATGG AAGTCCACCA GGCGTTCTGG CCCAAGGGCT TCCCCGGTTC CTATGACGAA
ACGCCGTTCT CTTATAACCC GGAAAAAGCC AAGAGCATTC TGTCCGAGGC CGGGATCGAG
ACCCCGATCA CCGTGTCGCT CGACGTGATC AACGCCGCGC CCTTTACCGA CATGGCGCAA
TCGTTGCAGG CGAGCTTTGC CGATGCGGGC ATCAACTTTG AGATCCTGCC CGGCACCGGC
AGCCAGGTCA TCACCAAGTA CCGCGAGCGC AGCCATGAGG CGATGCTGCT GTACTGGGGC
CCGGACTTCA TGGATCCGCA CTCCAACGCC AAGGCCTTCG CCTATAACTC CAACAACGCA
GACGACTCCT ATGCCGCCAC AACCACATGG CGCAATGCAT GGGCCGTGCC GGATGCGCTC
AACGAGAAAA CCATGGCGGC TCTGACCGAG AGCGACGCCG AGGCCCGTCT CAACATGTAT
CGCGAGCTGC AAAAAGAAGT GCAGGCCGAG TCGCCCATCG TGATCATGTT CCAGGCCGCC
TATCAGGTTG CCATGAACGA GGCCGTTTCT GGCTATGTGA ACGGCGCCAC CTCGGATTTT
GTCTTCTACC GTCTGGTTGA AAAACAGTAA
 
Protein sequence
MKHLKTTLLA SALMLPLAAP AVLADTPEGV LVVAQNIDDV VAIDPAQAYE FTSGELVTNL 
YDRLVQYDAE DTTVLAAGLA SEWVTDADAK TITFTLRDGA TFASGNPVTA EDVVYSFSRV
VKLNLTPAFI LTQLGWTADN IGEMVTGEGN TVTVKYAGDF SPAFVLNVLA ARPASIVDSK
LVQENEVDGD MGNAWLNANA AGSGPFTLQR YAAGQMVRMQ ANPTYFNGAP KIDSVIIRHV
AESATQQLLL EQGDVDLARN MTPDQVASLE SGEIKVETYP QAAVHFLSFN QKTESLTPPA
VWEAARYLVD YKGMTETIIK GQMEVHQAFW PKGFPGSYDE TPFSYNPEKA KSILSEAGIE
TPITVSLDVI NAAPFTDMAQ SLQASFADAG INFEILPGTG SQVITKYRER SHEAMLLYWG
PDFMDPHSNA KAFAYNSNNA DDSYAATTTW RNAWAVPDAL NEKTMAALTE SDAEARLNMY
RELQKEVQAE SPIVIMFQAA YQVAMNEAVS GYVNGATSDF VFYRLVEKQ