Gene Elen_1347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1347 
Symbol 
ID8415645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1614277 
End bp1615482 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID645024316 
Productaminotransferase class V 
Protein accessionYP_003181705 
Protein GI257791099 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.145912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.878821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCG TGCCGAACGA CTACGTGTAC CTCGACTACG CCGCGACGGC GCCCTTATGC 
GAGGAGGCCG CCGAGGCCAT GGCCCCTTAT CAGGTGCCCG GCCGCGCGAA CCTCGCGGTC
GGCGGCAACG CGAATTCGCT CCACGGGCCC GGCCGTGCCG CGTTCGCCGC GCTCGAGGAA
GCGCGCCGAT CCATCGCGCG CGACCTCGGC GCGCGTCGTC CCGACGAGAT CGTGTTCACC
AGCGGCGCCA CCGAGGCCGA CGACGCGGCC CTGCTGGGCA TCGCGCAGGC CGCCGCGGAC
GAGCGCCGTC GGCGCGGAGC AGGGGATTTC GTCCCGCACG TCGTGGTCAC CGCGGTCGAG
CACGACGCTG TGCTGGCGCC CGCGAAGCGT CTGGAATCGC AGGGTTTCCG CGTCACGCGG
CTCGCCCCGA ACCGTCAGGG CTTCATCGAG GAGCGCGCGT TGGAGGCGGC GCTCGACGCC
GATACGGTGC TCGTGTCGGT GCAGGCCGCC AACAGCGAAG TCGGCAGCAT CCAGCCCATC
GCCGATCTCG CCCGTGTCGC GCACGATCAT GCCGCGCTGT TCCACACCGA TGCCGTGCAG
GCGCTGGGGA AAGCTCGCGT GAACCTGCAG GAGCTCGACG TGGACGCCGC GTCCTTCTCG
GCTCATAAGG TGGGCGGCCC CAAAGGCGCC GGCGCGCTGT ATCTGCGCGC CCGCACGCCG
TTTCATGCCT ACGCTATTGG CGGCGGCCAG GAAGGAGGCC GGCGCAGCGG CACGCAGAAC
GTGGCCGGCA TCGTCGGGTT CGCGGCAGCC GTGCATGCGG CGACCGCGAT GCAGGAGGCG
GAGGCGGCTC GCCTGCGGGT TCTGCGCGAC AGGCTGTACG AGCGGCTGGG CGCCATCGAC
GCGGTGGAGG CCACCGTGGA CGTTGCGCCG GGCAGCGAGG ATTTCCTTCC GAACATCGTG
CATGTGCTGG TGGACGGTTT GGAAAGCGAA ACGCTCATCC TTCGCTTCGA CATGCAGGGC
TTCGGCGTGT CGGGCGGGTC TGCCTGCTCG TCGCACTCGC TGGAACCCAG CCACGTGCTG
CGTTCCCTCG GCATCGACGC CGACCGCGCG CACGGCGCCC TGCGCATCTC GATGGGGCGC
TACACCGACG AAGCCGATGT CGAAGCCTTC GCGGTCGCCA TGGAGAAGAG CCTGAACTGG
AACTGA
 
Protein sequence
MASVPNDYVY LDYAATAPLC EEAAEAMAPY QVPGRANLAV GGNANSLHGP GRAAFAALEE 
ARRSIARDLG ARRPDEIVFT SGATEADDAA LLGIAQAAAD ERRRRGAGDF VPHVVVTAVE
HDAVLAPAKR LESQGFRVTR LAPNRQGFIE ERALEAALDA DTVLVSVQAA NSEVGSIQPI
ADLARVAHDH AALFHTDAVQ ALGKARVNLQ ELDVDAASFS AHKVGGPKGA GALYLRARTP
FHAYAIGGGQ EGGRRSGTQN VAGIVGFAAA VHAATAMQEA EAARLRVLRD RLYERLGAID
AVEATVDVAP GSEDFLPNIV HVLVDGLESE TLILRFDMQG FGVSGGSACS SHSLEPSHVL
RSLGIDADRA HGALRISMGR YTDEADVEAF AVAMEKSLNW N