Gene TM1040_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1493 
Symbol 
ID4077049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1598143 
End bp1599129 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content59% 
IMG OID638006806 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_613488 
Protein GI99081334 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.140791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG TGCTAACAGC CGCCGCCATG GCACTGATGG CAACCTCTGC TGCGCAGGCC 
GCAGATGAGG TGAAGCTGCA ACTGAAATGG GTCACCCAGG CGCAGTTTGC AGGCTACTAT
GTGGCGCTGG ACAAAGGGTT TTACGGGGAA GAAGAGCTGG ATGTGACCAT TCTTCCCGGT
GGCCCCGACA TCGCGCCCAC GCAGGTGATC GCAGGGGGCG GCGCGGATGT GACCGTGGAA
TGGATGCCCG CAGCTCTGGC TGCGCGGGAA AAGGGGCTGC CGCTGGTCAA CATTGCCCAG
CCGTTCAAAT CTTCTGGCAT GATGCTCACC TGCTGGAAAG ACACCGGCAT CTCTGCACCC
AAGGATCTCG CGGATCGCAC TCTTGGGGTC TGGTTCTTCG GCAATGAGTT CCCCTTCATG
AGCTGGATGG GCAAGCTCGG TATCTCGACA GAGGGCAAGG GTCCCGAAGG CGTAGAGGTT
TTGAAACAGG GCTTTAACGT CGATCCTCTG CTGCAGCGGC AGGCCGATTG CATCTCCACC
ATGACCTATA ACGAATATTG GCAGGTGATT GATGCGGGGG TCTCGGCTGA TGAGCTTGTG
ACCTTCAAAT ACGAGGACCA AGGCGTCGCA ACGCTCGAGG ATGGCCTCTA TGTGCTCGAG
GACAACCTCT CCGATCCGGC CTTCGTCGAC AAAATGGAGC GCTTTGTCCG AGCGTCTATG
AAGGGTTGGA AATACGCCGA AGAGAACCCG GAGGAGGCCG CAGAAATCGT GCTCGACAAT
GATGCCTCTG GTGCCCAGAC CGAAACCCAC CAAAAGCGGA TGATGGGCGA GATTGCCAAA
CTCACTTCCG GCAGCAATGG CGCGCTTGAC GTGGCAGACT ACGAGCGCAC GGTGCAGACC
CTGCTGAGCG GCGGCTCTGA CCCGGTCATC ACCAAGGCGC CTGAAGGGGC GTGGACCCAT
GCGATCACCG ATGCGGCGCT GAACTGA
 
Protein sequence
MKNVLTAAAM ALMATSAAQA ADEVKLQLKW VTQAQFAGYY VALDKGFYGE EELDVTILPG 
GPDIAPTQVI AGGGADVTVE WMPAALAARE KGLPLVNIAQ PFKSSGMMLT CWKDTGISAP
KDLADRTLGV WFFGNEFPFM SWMGKLGIST EGKGPEGVEV LKQGFNVDPL LQRQADCIST
MTYNEYWQVI DAGVSADELV TFKYEDQGVA TLEDGLYVLE DNLSDPAFVD KMERFVRASM
KGWKYAEENP EEAAEIVLDN DASGAQTETH QKRMMGEIAK LTSGSNGALD VADYERTVQT
LLSGGSDPVI TKAPEGAWTH AITDAALN