Gene Elen_2535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2535 
Symbol 
ID8416859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2967174 
End bp2968379 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID645025516 
Producthypothetical protein 
Protein accessionYP_003182879 
Protein GI257792273 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.944384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTCA TCAAGGCAAT GGGAGGTTCT ACTCGGGGAG TTCTTGCCGA TTCGTGGAGA 
GATTTCTTCT ACTGCGAGTC GCTTGATGCC TCCACATTGG CGTCAAAAGG ACAGAAGAAG
ACAGGGAACC CTGATCGTTC CTCAAACGCC AAGGGAGACG AAAACGTTAT CTCGAACGGT
TCAATTGTTG CCATAAACGA TGGCCAGTGC ATGATCATAG TCGAATCAGG AGCTGTCGTT
GACCTTTGCG CCGAACCCGG CGAATACCTA TATGAAACGT CAAGCGAGCC GAGCGTCTTC
TACGGCCCGT TGGGCGCAAA CGTCAAGAGC ACGTTCAAAG AGATGCAACG TCGTATAGGA
TTCGGCGGCA GTCCCGGGAA AGACCAGCGC GTTTACTACT TCAACATCAA GGAGATCGTC
GGAAACAAAT ACGGAACCCC TAACCCCGTT CCCTTCCGCG TCGTCGATGC CAATATTGGC
CTTGATATCG ACATAGCCGT ACGCTGCAAT GGCGAATATT CGTACAGGAT AGATAACCCC
CTGTTGTTCT ACCGCAACGT TTGCGGGAAC GTTGAAACCA CCTACACAAA GGATCAACTG
GATTCTCAGT TGAAAAGCGA GCTATTGACC GCCCTGCAAC CTGCATTTTC CCGCATCTCG
GCTGCTGGCG TGCGATACAG CAACGTTCCC GCGCATACTC GCGAACTTGC AGCGCTTCTA
AACGAAGAGC TCACTGACAC ATGGCGAAGC CTTCGCGGAA TGTCCGTCGT GTCCTTTGGA
ATGAATTCTA TTCGAGCTTC GGAAGAGGAC GAACTCGTTA TCAAGCGACT TCAGAGCGCT
GCGGTGATGC GCGATCCGAA TATGGCAGCC GCCAATCTGG TAGCCGCCCA ATCCGACGCC
ATGCGCATCG CGGCAGGAAA CGCAAACGGA GCAGCTAACG GCTTTATCGG TTTAGGGCTA
GCGAACATGA CAGGCGGAAC GGATGCGGGA CGTTTGTTCA CCGACGCAGC GACCAGCTTT
CATCATTCCG GATCCTTCAA TCAGCAGAAC TGGACTTGCT CTTGCGGAGT AAAGAACTCG
GGGAACTTCT GCCAAAACTG TGGCAAAGAG CGCTGCAGTG ATTCCGCATG GACTTGCCCT
TCATGCGGTA CGAGCAGCGC AGGGAACTAC TGCTCGCAGT GCGGCAAAGC CAGAACGCAG
CCCTGA
 
Protein sequence
MGLIKAMGGS TRGVLADSWR DFFYCESLDA STLASKGQKK TGNPDRSSNA KGDENVISNG 
SIVAINDGQC MIIVESGAVV DLCAEPGEYL YETSSEPSVF YGPLGANVKS TFKEMQRRIG
FGGSPGKDQR VYYFNIKEIV GNKYGTPNPV PFRVVDANIG LDIDIAVRCN GEYSYRIDNP
LLFYRNVCGN VETTYTKDQL DSQLKSELLT ALQPAFSRIS AAGVRYSNVP AHTRELAALL
NEELTDTWRS LRGMSVVSFG MNSIRASEED ELVIKRLQSA AVMRDPNMAA ANLVAAQSDA
MRIAAGNANG AANGFIGLGL ANMTGGTDAG RLFTDAATSF HHSGSFNQQN WTCSCGVKNS
GNFCQNCGKE RCSDSAWTCP SCGTSSAGNY CSQCGKARTQ P