Gene TM1040_2656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2656 
Symbol 
ID4077567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2791376 
End bp2792497 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID638007980 
Productsaccharopine dehydrogenase 
Protein accessionYP_614650 
Protein GI99082496 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATGGA ACATCTGTGT TGTGGGCGCG GGCAAGATCG GCCAGGCGAT TGCCACGTTT 
TTAAAGACAT CTGCCAACTA TCAGGTGACC CTCGCGGATC ATGACCTGAA TGCGCTGGGC
GCGGTGGCGG AGCTGGGGGT GCCGACCCGG CAGATCGACG CCAAGGATCC GGTGTCGCTG
GCAAAGGGGC TTCAGGGGTT TGACGCGGTG ATTTCTGCCG CGCCGTTCTT TTTGACGCCA
ATGATAGCGG AGGCCGCAAA AACCGCCGGC GCGCATTATT TCGACCTCAC CGAGGATGTG
GCCGCCACCG AAGCGGTGCG CAAACTGGCC GAAGGCAGCG AGACGGTGTT CATGCCCCAG
AGCGGCCTTG CGCCCGGTTT CGTGGGCATC GCGGGCGCGT CACTGGCGGC AGAATTCGAT
GAGCTGGACA GCCTGCACAT GCGGGTCGGC GCGCTGCCGA AGTTTCCGAC CAACGCGTTG
AAATATAATC TCACCTGGTC CACCGACGGG CTGATCAACG AGTATTGCAA CCCCTGCGAT
GCCATCGTGA ATGGCGCGCG CACCAAGACA GCGCCGCTCG AAGATTACGA GCGTCTGAGC
CTTGATGGGG TTGAGTATGA ATGCTTCAAC ACCTCGGGTG GGCTTGGCAC CTTGCCAGAG
ACGCTGGACG GGAAGGCGCG GGCGGTCTCG TATCGGTCGA TCCGCTATCC CGGTCACTGC
GACATCCTGA AAATGCTGCT GCATGATCTG GGGCTGGAAC GCCGCCGCGA CCTGATGAAA
GAGATTTTCG AGAGCGCATT GCCGCGCACC GATCAGGACG TGGTGCTGGT CTATTGCACC
GCGCGGGGCC GCATCAATGG CGAGCTGCGT GAAAAGAGCC TCATCAACAA GAGCTATGCC
CGCCAGATCG GCGGCAAGAC CTGGAGCGCG ATCCAAGTCA CCACCACCGC CGGGGTGCTG
GGGGTTGTGG ATCTGGTGCG GCAAGGCGTC CTGCCCGCGC GCGGCTTTGT AAGCCAGGAA
CAGGTGAAGC TGCAGGACTT CCTCGAGACA GAATTTGGCC AGCTCTACCG GGCGGGCGAC
ATCGACCACA TGACAGACAC AACAAAATTG GCAGCTGAGT GA
 
Protein sequence
MQWNICVVGA GKIGQAIATF LKTSANYQVT LADHDLNALG AVAELGVPTR QIDAKDPVSL 
AKGLQGFDAV ISAAPFFLTP MIAEAAKTAG AHYFDLTEDV AATEAVRKLA EGSETVFMPQ
SGLAPGFVGI AGASLAAEFD ELDSLHMRVG ALPKFPTNAL KYNLTWSTDG LINEYCNPCD
AIVNGARTKT APLEDYERLS LDGVEYECFN TSGGLGTLPE TLDGKARAVS YRSIRYPGHC
DILKMLLHDL GLERRRDLMK EIFESALPRT DQDVVLVYCT ARGRINGELR EKSLINKSYA
RQIGGKTWSA IQVTTTAGVL GVVDLVRQGV LPARGFVSQE QVKLQDFLET EFGQLYRAGD
IDHMTDTTKL AAE