Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3620 |
Symbol | |
ID | 4075047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 676087 |
End bp | 677670 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005139 |
Product | saccharopine dehydrogenase |
Protein accession | YP_611849 |
Protein GI | 99078591 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00695172 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAGGGTTT TGATAGTCGG TGGCACTGGC GTTTTTGGCG CGCGCCTGGC CGAGCTTTTA GTTCGAGACG GACATGATCT GACCCTTGCG GCGCGCAATT TTAGGCGCGC GCAGCGGCTG GCCTCCAAGC TGGGATGCGC TGCGCTGCGC CTTGATCGGC AGGGCGACCT GACCGGCATT GCAGGCTTTG ATGTGGTGGT AGATGCTGCG GGGCCGTTTT CCACCGAAGG CAAAGACCCC TACCGACTGG CCCGTGCCGC GTTGAAGGCA GGGCAACACT ATCTCGATCT ATCTGACAAC GCGGCTTTTT GCGCGGGCAT TCGCAGTTTG GACGCAGAGG CGCGTGCGGC CGGGCGCGCG GCGATTTCAG GTCTATCGAC AGTGCCCGCA CTTTCTAGTG CGGCTGTCAG AGCATTGTCT GCGGGTGCGC GACCAGAGGT CATCGAAAGC GCGATTTTGC CGGGCAATCG CAGCCCGCGT GGCCTTGCGG TCATGCGCTC TATTTTGATG CAGGCCGGTC GTCCCATGCG GGTCTGGCGC GGCGGTGCAT GGGAGACGGT GTCGGGTTGG TCGCAGCCAA AGAGCTATGA TTTGCCCCAA GGCTTGCAAC GCCAAGCGTG GCAGATCGAG GTGCCGGATC AAAGGCTCTT TCCCGATCAT TTTGGGGCGG ACAGTGTGGC GTTCCGGGCC GGGCTCGAAC TTGCGGTCAT GCGCTATGGC TTGGCCGCAT TTGCGTATCT GCGCAGATTG GTTCCTGTGC CTATCAACGG TTTTGTTCTG GGGATCTTTA AACTGGGAGC CGATCTTCTG GCTCCGTTCG GGAGTGGGCG CGGCGGCATG TCTGTCATGG TTATCACCAA TGGCGAGCGG CGTTTTTGGC GTATGCTCGC CGAGGGGGGA GATGGGCCTT ATGTTCCCGC GAGTGCGATA CGCGCTTTGC TGCGTCGCGG TGAGTTTCCG GTTGGGGCGC AACCCGCGCT GGAGGTGATT TCGCTCGCTG AGGCGGAGGG CGCAATGGGC GATCTCTCAG TCACGACCGA AGTGGTCTCG GAGCCTGTGC AAGCCATCTT TCCGCGGGTT TTGGGCGCGT CATTTGACGA CCTGCCCGAA GTCGTGCGCG CAACTCATCA GACCTCGGAC CTGAGCCGCT GGCAGGGGCA GGCGAGTGTG CGTCGGGGTC GCAGCCTCTG GAGTCGTTTT CTTGGTTGGG TGTTTGGATT TCCGGCCCAG GCGGCGCATA TCGATGTTGA GGTCGTAAAA ACAGTCAGCG GCGACAGTGA GCATTGGCAA CGCCGGTTTG GGGGTCGGCT GTTTCATTCC GTTCTGACCA GAACACCTGC GGGAATGACG GAGCGGTTTG GGCCGTTCAC GTTTCTTCTC GGGCTTAGGG TTTCAGAGGG CGCGCTGCAT TTCCCTGTCC GCTCGGCTCG ATTGGGCCCT CTGCCGTTGC CCCGTTGGCT CTTGCCCGTG TCGATTGCGC GAGAGCATGA GCGGGATGGA GGCTTCTGTT TCGATGTGAA GCTTCTGACG CCGCTTACTG GAGATCTGCT GGTGCACTAT CAGGGCCAGC TCGCCCCCGC CTAG
|
Protein sequence | MRVLIVGGTG VFGARLAELL VRDGHDLTLA ARNFRRAQRL ASKLGCAALR LDRQGDLTGI AGFDVVVDAA GPFSTEGKDP YRLARAALKA GQHYLDLSDN AAFCAGIRSL DAEARAAGRA AISGLSTVPA LSSAAVRALS AGARPEVIES AILPGNRSPR GLAVMRSILM QAGRPMRVWR GGAWETVSGW SQPKSYDLPQ GLQRQAWQIE VPDQRLFPDH FGADSVAFRA GLELAVMRYG LAAFAYLRRL VPVPINGFVL GIFKLGADLL APFGSGRGGM SVMVITNGER RFWRMLAEGG DGPYVPASAI RALLRRGEFP VGAQPALEVI SLAEAEGAMG DLSVTTEVVS EPVQAIFPRV LGASFDDLPE VVRATHQTSD LSRWQGQASV RRGRSLWSRF LGWVFGFPAQ AAHIDVEVVK TVSGDSEHWQ RRFGGRLFHS VLTRTPAGMT ERFGPFTFLL GLRVSEGALH FPVRSARLGP LPLPRWLLPV SIAREHERDG GFCFDVKLLT PLTGDLLVHY QGQLAPA
|
| |