Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1347 |
Symbol | |
ID | 8415645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1614277 |
End bp | 1615482 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024316 |
Product | aminotransferase class V |
Protein accession | YP_003181705 |
Protein GI | 257791099 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.145912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.878821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCCG TGCCGAACGA CTACGTGTAC CTCGACTACG CCGCGACGGC GCCCTTATGC GAGGAGGCCG CCGAGGCCAT GGCCCCTTAT CAGGTGCCCG GCCGCGCGAA CCTCGCGGTC GGCGGCAACG CGAATTCGCT CCACGGGCCC GGCCGTGCCG CGTTCGCCGC GCTCGAGGAA GCGCGCCGAT CCATCGCGCG CGACCTCGGC GCGCGTCGTC CCGACGAGAT CGTGTTCACC AGCGGCGCCA CCGAGGCCGA CGACGCGGCC CTGCTGGGCA TCGCGCAGGC CGCCGCGGAC GAGCGCCGTC GGCGCGGAGC AGGGGATTTC GTCCCGCACG TCGTGGTCAC CGCGGTCGAG CACGACGCTG TGCTGGCGCC CGCGAAGCGT CTGGAATCGC AGGGTTTCCG CGTCACGCGG CTCGCCCCGA ACCGTCAGGG CTTCATCGAG GAGCGCGCGT TGGAGGCGGC GCTCGACGCC GATACGGTGC TCGTGTCGGT GCAGGCCGCC AACAGCGAAG TCGGCAGCAT CCAGCCCATC GCCGATCTCG CCCGTGTCGC GCACGATCAT GCCGCGCTGT TCCACACCGA TGCCGTGCAG GCGCTGGGGA AAGCTCGCGT GAACCTGCAG GAGCTCGACG TGGACGCCGC GTCCTTCTCG GCTCATAAGG TGGGCGGCCC CAAAGGCGCC GGCGCGCTGT ATCTGCGCGC CCGCACGCCG TTTCATGCCT ACGCTATTGG CGGCGGCCAG GAAGGAGGCC GGCGCAGCGG CACGCAGAAC GTGGCCGGCA TCGTCGGGTT CGCGGCAGCC GTGCATGCGG CGACCGCGAT GCAGGAGGCG GAGGCGGCTC GCCTGCGGGT TCTGCGCGAC AGGCTGTACG AGCGGCTGGG CGCCATCGAC GCGGTGGAGG CCACCGTGGA CGTTGCGCCG GGCAGCGAGG ATTTCCTTCC GAACATCGTG CATGTGCTGG TGGACGGTTT GGAAAGCGAA ACGCTCATCC TTCGCTTCGA CATGCAGGGC TTCGGCGTGT CGGGCGGGTC TGCCTGCTCG TCGCACTCGC TGGAACCCAG CCACGTGCTG CGTTCCCTCG GCATCGACGC CGACCGCGCG CACGGCGCCC TGCGCATCTC GATGGGGCGC TACACCGACG AAGCCGATGT CGAAGCCTTC GCGGTCGCCA TGGAGAAGAG CCTGAACTGG AACTGA
|
Protein sequence | MASVPNDYVY LDYAATAPLC EEAAEAMAPY QVPGRANLAV GGNANSLHGP GRAAFAALEE ARRSIARDLG ARRPDEIVFT SGATEADDAA LLGIAQAAAD ERRRRGAGDF VPHVVVTAVE HDAVLAPAKR LESQGFRVTR LAPNRQGFIE ERALEAALDA DTVLVSVQAA NSEVGSIQPI ADLARVAHDH AALFHTDAVQ ALGKARVNLQ ELDVDAASFS AHKVGGPKGA GALYLRARTP FHAYAIGGGQ EGGRRSGTQN VAGIVGFAAA VHAATAMQEA EAARLRVLRD RLYERLGAID AVEATVDVAP GSEDFLPNIV HVLVDGLESE TLILRFDMQG FGVSGGSACS SHSLEPSHVL RSLGIDADRA HGALRISMGR YTDEADVEAF AVAMEKSLNW N
|
| |