Gene Elen_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2136 
Symbol 
ID8416458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2510674 
End bp2511843 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content65% 
IMG OID645025123 
ProductNLP/P60 protein 
Protein accessionYP_003182488 
Protein GI257791882 
COG category[S] Function unknown 
COG ID[COG3883] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000118144 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.895402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC ATACAATCGG ACTGAGCAGG CGAACATTCC TCACCGGCGC GGCCGCGCTC 
GGCGCCCTTT CCGTTCTGGC TCCGACGACC GCGTTCGCAG AGACGGCTGC TGAGAAGCAG
GCCGAGGCCG ATGCGGTGCG CAACCAGCTG ATCGGCTTGC AGGCCGATCT CGAGGCTGCC
GAAATCAGCT ATTACTCCGC CCTCGACGAG CGCGATGCCG CGCAGAAGGC GATGGAGGAT
CAGCAGGCCA AAATCGACGA CGCTAACAGC CAGATCAGCG ATCTGCAGGA CAAGCTGGGC
ACGCGTGCCC GCAACATGTA CCGCAACGGA TCCACGAGCT TCGTCGATTT CGTCCTGGGT
GCCGCCTCGT TCGAAGAGTT CACCCAGAAT TGGGATCTTC TCAACAAGAT GAACGAGAAC
GACGCCGATA TGGTCGACCA GACGAAGACC TTGCGCGAAG AGCTGCAGGC TGCCAAGGAC
GAATTCGCTC GCCAAGAGCA AATCGCCTCG GCCAAGGCTG CGGAAGCCAA ACAGATCCAA
AGCGATGTCC AGGCCAAGGT CGACCAGGCT ACCGAGCTGG TCAGCTCCCT CGATGCCGAA
GCGCAGGAGC TCCTTCAGCA GGAGCAGGCT GCGGCGGCGG CCGCTGCGGC GGCGGAGGCT
GCGGCCGAGG CCGAGCGTCA GCGCCAGGCG GAGCAGGCGG TGAATCCCGG CGGCGGCGGT
GGCGGCGCTA GCGGCGGTTC AGGTTCCGGC TCCGGCGGCG GAAGCGGTTC GTCCGGCGGC
GGCGGTGGTG GCGGTTCTGT AGTGTATCCG TCCCGTCCGG TCGGTTCCTA CGACTCGGTC
GTGGGCTACG CTATGAGCCG TATCGGCTGC CCCTACATCT GGGGTGCCGA AGGCCCCGAC
TCCTTTGACT GCTCCGGCTT GGTCACGTGG GCGTACCGCC AGGTGGGCAT GTATCTGCCG
CACCAGAGCG AGGCGCAGTA CGCGGCAGCC GCGCGCGTCG TATCGGTTTC CGAGGCGCGT
CCGGGCGACG TGCTGTGGCG TTACGGTCAC GTCGGTATCG CGGTGAGTGC AGGCGGCTCG
CACTACGTGC ACGCTCCCAC CTTCAACGCG TACGTGCGCG ACACCGATCC GCTGTCGTGG
GCGCAGTTCA CGAACGCGCT GCAGTTCTAA
 
Protein sequence
MSEHTIGLSR RTFLTGAAAL GALSVLAPTT AFAETAAEKQ AEADAVRNQL IGLQADLEAA 
EISYYSALDE RDAAQKAMED QQAKIDDANS QISDLQDKLG TRARNMYRNG STSFVDFVLG
AASFEEFTQN WDLLNKMNEN DADMVDQTKT LREELQAAKD EFARQEQIAS AKAAEAKQIQ
SDVQAKVDQA TELVSSLDAE AQELLQQEQA AAAAAAAAEA AAEAERQRQA EQAVNPGGGG
GGASGGSGSG SGGGSGSSGG GGGGGSVVYP SRPVGSYDSV VGYAMSRIGC PYIWGAEGPD
SFDCSGLVTW AYRQVGMYLP HQSEAQYAAA ARVVSVSEAR PGDVLWRYGH VGIAVSAGGS
HYVHAPTFNA YVRDTDPLSW AQFTNALQF