Gene TM1040_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0106 
Symbol 
ID4078691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp112632 
End bp113774 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content62% 
IMG OID638005393 
Productsaccharopine dehydrogenase (NADP+, L-glutamate forming) 
Protein accessionYP_612101 
Protein GI99079947 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTC ACTGGTGCGG CACCGGCCTC TCCGCCATTC CCGGCCTGCG TCGCCTGCTC 
GAAGCGGGTC ACGACGTCGC CGTCTGGAAC CGCACACCCG AAAAAGCCGC CGAGGCTGTT
GGGGATCTGA CCACCAACAT CCACAAATTC TCCATTGCAC GCCTCTCGGA GCTTCTGAGC
CCGGCGGACG TCGTGGTCTC CATGCTGCCC GGCGACTGGC ATGTGGAACT CGCCGAGCTC
GCAATTTCCA AGGGGGCGCA TTTTGTGTCC TCCTCCTACA TCTCGCCGGA GATGCGCGCC
CTCGACCAAA AGGCCAAAGA CGCCGGAGTC GCGCTGGTCA ATGAGGTCGG GCTTGATCCG
GGCATCGATC ACCTGATGGC CCATGCCCTC GTGGCTGAAT ACGCGGAATC TCCGGCCTTC
GACGCGGACA ATGAGATCAG CTTTCTGTCC TATTGCGGCG GCATCCCAAA GATCCCGAAC
CCATTTCGCT ACAAGTTCAG CTGGTCGCCC CTCGGCGTGC TGAAGGCCCT GCGCTCGCCC
TCGCGCTCGA TCCGCGATTT TGAGGTTCTG GACGTGGCGC GCCCCTGGGA TGCGATCTCG
AGCTATGACG CGCCGCTTGC GACGCCCGAA ACCTTTGAGG TTTATCCCAA CCGCGACAGT
CTGCCGTTCA TGGAGCAGTA TCACTTCGAC AAGGACTGGA AGGTCAAAAC CTTCGTGCGC
GGCACCCTAC GTCTGAATGG CTGGACCGAG GCCTGGGCGG ATGTCTTCAA AGAAGTCGAA
ACGCTTGAAG GCCCCGAAGG CGATGCTCGC CTCAAGGAAA TGTCCGATCA GTTCTGGGAC
GAAAACGCCT ATGACGAAGG CGAGCCGGAT CGCGTGGTGC TCTGTGTGGA CCTCAAGGCG
GAAAAAGACG GCCAGACCAA GTGGCACAAG ACCTATGTGA TGGACGCATG GGGCGACGAG
CGCGGAAGCG CCATGGCGCG TCTGGTGTCC TATCCGGTGT CCTACGCCAT TGAGGCCGCG
ATGAACGGCA AGATCGCACC CGGCGTCAGC GCCGCGCCCA GCGATCCGGC GCTGGTGGAC
AGCTGGATGG GGCGCATCGG CGCACTGGCG CAGCACCTTC AGGTGGTGTC CCACCGCTCC
TGA
 
Protein sequence
MTIHWCGTGL SAIPGLRRLL EAGHDVAVWN RTPEKAAEAV GDLTTNIHKF SIARLSELLS 
PADVVVSMLP GDWHVELAEL AISKGAHFVS SSYISPEMRA LDQKAKDAGV ALVNEVGLDP
GIDHLMAHAL VAEYAESPAF DADNEISFLS YCGGIPKIPN PFRYKFSWSP LGVLKALRSP
SRSIRDFEVL DVARPWDAIS SYDAPLATPE TFEVYPNRDS LPFMEQYHFD KDWKVKTFVR
GTLRLNGWTE AWADVFKEVE TLEGPEGDAR LKEMSDQFWD ENAYDEGEPD RVVLCVDLKA
EKDGQTKWHK TYVMDAWGDE RGSAMARLVS YPVSYAIEAA MNGKIAPGVS AAPSDPALVD
SWMGRIGALA QHLQVVSHRS