Gene TM1040_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2552 
Symbol 
ID4076683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2694826 
End bp2696676 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content59% 
IMG OID638007876 
Productextracellular solute-binding protein 
Protein accessionYP_614546 
Protein GI99082392 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.437338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0565568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCG CGGCCCTGAT GACCTCGGCG CTTGCGGTAA AGAGCACAGA GAGCGAGGCG 
ATCACCGTCA GCCATGGCTA TTCGTTTTAC GGCGATCTGG ATTATCCGGC TGATTTTGAA
CACTTCGACT ATGTAAACCC GGACGCGCCG AAGGGAGGGG AGCTTGCGAT TCCCTTCATC
GGGACTCTCG ACAGCATGAA CCCCTATTCT GGCAAGGGTC GGGCGCATGC GTTTTCTGTC
TACCCTTACG AGAGCCTCCT CTCGGATGCG GCACCTGCGG ATAAATATGG TCAGTCCTAC
TGCCTGCTCT GCGAAACCGT GGAATATCCC GAAGACAAGA GCTGGGTGAT CTTTCGCATG
CGTCCCGAGG CGCGGTTCTC CGACGGCACA CCGGTCACCG CACATGACAT CGCCTTCAGC
CACAACCTGC TGCTGGATCA AGGTTTGAAG TCCTACGCAG ACTTCATTCG CCGGCTGATC
CCAAAAGTTG AGGTGATCGA CGATCACACC ATCAAGTTCT ATTTTGCTGA TGGCGTCTCA
CGCCGGAGCC TGATCGAGCA GGTGGGCGGC GTACCGGCCT GGTCGAAGAA GTGGTTCGAT
GAAACGGGAC AGAAGCTCAA CGAGAACTGG ATCGAGGGGC CTCCGGGCTC TGGTCCCTAT
GTGATCGAGG AAATCGACCT GTCGCGTCGT ATCGTGCTGA AACGCAACCC TGATTATTGG
GGCAAGGATC TGCCGGTCAA CCAGGGGCGG CACAACTTTG ATTCGCTGCG GGTGGAAATC
TTTGCCGACG ACACGGCAGC CTTTGAGGCC TTCAAGGCGG GCGAGTATAC CTTCCGCGCC
GAAGGCGACA GCAAGAAATG GGCCAGCGGC TATGACTTTC CGAAGGTCGA TAGCGGCGCG
GTGAAACTCG AAGAGCTGCC AAATGGTGCA CCGCCTGCGT CATCGGGCAT CGTGTTCAAT
CTCGCCTCTC CCGAATTGCA AGATCGGCGT GTGCGCGAAG CGCTGGCTCT GGCGTTCAAC
TTCGAGTGGT CCAACGAGAG CCTTCTCTAT GGTCTCTTTG ATCGCCGTGC GTCGTTCACC
CAGAACTCCC CCCTGATGGC CAAAGGTGTG CCCGAGGGCG CCGAGCGCGC GTTTCTGCAG
AGCCTCGGCG ACCTTGTTCC GGACGAGATG CTGACCGAGG AGGTCTATAT CCCGCCGACC
TCCGATCCTT CGCGACTGTT TGACCGCCGC AACGCACGCA AGGCAGCGGC CTTGCTGGAT
GCGGCCGGGT ATACGGTGGG CGATGGCGGC ATGCGCATGT CTCCGGATGG CTCTGCGTTT
GAGCTGGATT TCCTGTTTTC CTCCTCATCC TCGCCCACCA CGCGCGGGGT GATGGAGAAC
TTCGTCGACA ACCTCGAAAA CCTTGGCGTT CAGGTCAATT TCGAGGTGGT GGATACGGCG
CAATACACCA GCCGGGAACG GGACCGCGAT TATGATCTGG TGGTCGACAG CTACACCACG
TTCCTTGGCA CCGGTACCGG TCTGGAGCAG CGGTTTGGGT CCGAGGCCGC AGCCTTCTCG
CTGTTCAACC CCGCCGGTTT GGCCTCCCCG CTGGTGGATG AGATCATCAC GAGATCGCTG
CACGCTGAGA CCCGCGAAGA AGAAACTACC GTCATGACGG CGCTGGATCG CGCTTTGCGG
CATGAGTTCA TCATGATCCC GCTGTGGTAT AACCCGAACC ATTGGGCCGC CTATTACGAT
CAATATGAGC ACCCGGCGGA AATCCCGCCC TATAGCCTCG GGTATCTGGA CTTCTGGTGG
TATAACGAGG ACAAGGCCAA AGCGCTGCGC GATGCTGGCG CGCTGAGGTA A
 
Protein sequence
MSTAALMTSA LAVKSTESEA ITVSHGYSFY GDLDYPADFE HFDYVNPDAP KGGELAIPFI 
GTLDSMNPYS GKGRAHAFSV YPYESLLSDA APADKYGQSY CLLCETVEYP EDKSWVIFRM
RPEARFSDGT PVTAHDIAFS HNLLLDQGLK SYADFIRRLI PKVEVIDDHT IKFYFADGVS
RRSLIEQVGG VPAWSKKWFD ETGQKLNENW IEGPPGSGPY VIEEIDLSRR IVLKRNPDYW
GKDLPVNQGR HNFDSLRVEI FADDTAAFEA FKAGEYTFRA EGDSKKWASG YDFPKVDSGA
VKLEELPNGA PPASSGIVFN LASPELQDRR VREALALAFN FEWSNESLLY GLFDRRASFT
QNSPLMAKGV PEGAERAFLQ SLGDLVPDEM LTEEVYIPPT SDPSRLFDRR NARKAAALLD
AAGYTVGDGG MRMSPDGSAF ELDFLFSSSS SPTTRGVMEN FVDNLENLGV QVNFEVVDTA
QYTSRERDRD YDLVVDSYTT FLGTGTGLEQ RFGSEAAAFS LFNPAGLASP LVDEIITRSL
HAETREEETT VMTALDRALR HEFIMIPLWY NPNHWAAYYD QYEHPAEIPP YSLGYLDFWW
YNEDKAKALR DAGALR