Gene TM1040_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3620 
Symbol 
ID4075047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp676087 
End bp677670 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content62% 
IMG OID638005139 
Productsaccharopine dehydrogenase 
Protein accessionYP_611849 
Protein GI99078591 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00695172 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGGGTTT TGATAGTCGG TGGCACTGGC GTTTTTGGCG CGCGCCTGGC CGAGCTTTTA 
GTTCGAGACG GACATGATCT GACCCTTGCG GCGCGCAATT TTAGGCGCGC GCAGCGGCTG
GCCTCCAAGC TGGGATGCGC TGCGCTGCGC CTTGATCGGC AGGGCGACCT GACCGGCATT
GCAGGCTTTG ATGTGGTGGT AGATGCTGCG GGGCCGTTTT CCACCGAAGG CAAAGACCCC
TACCGACTGG CCCGTGCCGC GTTGAAGGCA GGGCAACACT ATCTCGATCT ATCTGACAAC
GCGGCTTTTT GCGCGGGCAT TCGCAGTTTG GACGCAGAGG CGCGTGCGGC CGGGCGCGCG
GCGATTTCAG GTCTATCGAC AGTGCCCGCA CTTTCTAGTG CGGCTGTCAG AGCATTGTCT
GCGGGTGCGC GACCAGAGGT CATCGAAAGC GCGATTTTGC CGGGCAATCG CAGCCCGCGT
GGCCTTGCGG TCATGCGCTC TATTTTGATG CAGGCCGGTC GTCCCATGCG GGTCTGGCGC
GGCGGTGCAT GGGAGACGGT GTCGGGTTGG TCGCAGCCAA AGAGCTATGA TTTGCCCCAA
GGCTTGCAAC GCCAAGCGTG GCAGATCGAG GTGCCGGATC AAAGGCTCTT TCCCGATCAT
TTTGGGGCGG ACAGTGTGGC GTTCCGGGCC GGGCTCGAAC TTGCGGTCAT GCGCTATGGC
TTGGCCGCAT TTGCGTATCT GCGCAGATTG GTTCCTGTGC CTATCAACGG TTTTGTTCTG
GGGATCTTTA AACTGGGAGC CGATCTTCTG GCTCCGTTCG GGAGTGGGCG CGGCGGCATG
TCTGTCATGG TTATCACCAA TGGCGAGCGG CGTTTTTGGC GTATGCTCGC CGAGGGGGGA
GATGGGCCTT ATGTTCCCGC GAGTGCGATA CGCGCTTTGC TGCGTCGCGG TGAGTTTCCG
GTTGGGGCGC AACCCGCGCT GGAGGTGATT TCGCTCGCTG AGGCGGAGGG CGCAATGGGC
GATCTCTCAG TCACGACCGA AGTGGTCTCG GAGCCTGTGC AAGCCATCTT TCCGCGGGTT
TTGGGCGCGT CATTTGACGA CCTGCCCGAA GTCGTGCGCG CAACTCATCA GACCTCGGAC
CTGAGCCGCT GGCAGGGGCA GGCGAGTGTG CGTCGGGGTC GCAGCCTCTG GAGTCGTTTT
CTTGGTTGGG TGTTTGGATT TCCGGCCCAG GCGGCGCATA TCGATGTTGA GGTCGTAAAA
ACAGTCAGCG GCGACAGTGA GCATTGGCAA CGCCGGTTTG GGGGTCGGCT GTTTCATTCC
GTTCTGACCA GAACACCTGC GGGAATGACG GAGCGGTTTG GGCCGTTCAC GTTTCTTCTC
GGGCTTAGGG TTTCAGAGGG CGCGCTGCAT TTCCCTGTCC GCTCGGCTCG ATTGGGCCCT
CTGCCGTTGC CCCGTTGGCT CTTGCCCGTG TCGATTGCGC GAGAGCATGA GCGGGATGGA
GGCTTCTGTT TCGATGTGAA GCTTCTGACG CCGCTTACTG GAGATCTGCT GGTGCACTAT
CAGGGCCAGC TCGCCCCCGC CTAG
 
Protein sequence
MRVLIVGGTG VFGARLAELL VRDGHDLTLA ARNFRRAQRL ASKLGCAALR LDRQGDLTGI 
AGFDVVVDAA GPFSTEGKDP YRLARAALKA GQHYLDLSDN AAFCAGIRSL DAEARAAGRA
AISGLSTVPA LSSAAVRALS AGARPEVIES AILPGNRSPR GLAVMRSILM QAGRPMRVWR
GGAWETVSGW SQPKSYDLPQ GLQRQAWQIE VPDQRLFPDH FGADSVAFRA GLELAVMRYG
LAAFAYLRRL VPVPINGFVL GIFKLGADLL APFGSGRGGM SVMVITNGER RFWRMLAEGG
DGPYVPASAI RALLRRGEFP VGAQPALEVI SLAEAEGAMG DLSVTTEVVS EPVQAIFPRV
LGASFDDLPE VVRATHQTSD LSRWQGQASV RRGRSLWSRF LGWVFGFPAQ AAHIDVEVVK
TVSGDSEHWQ RRFGGRLFHS VLTRTPAGMT ERFGPFTFLL GLRVSEGALH FPVRSARLGP
LPLPRWLLPV SIAREHERDG GFCFDVKLLT PLTGDLLVHY QGQLAPA