Gene Elen_2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2856 
Symbol 
ID8417187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3317547 
End bp3318623 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content68% 
IMG OID645025835 
Productpeptidase M24 
Protein accessionYP_003183191 
Protein GI257792585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.296278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAAC AGCGCATCAA AACCGTGAGG CGCAACCTCG CGAACCGAGG TCTCGAGCAG 
ATGCTCGTAT GCGATCCCCG TTCCATCCAC TATCTCACCG GAGCGTTCAT CGAGCCGGGC
GAGCGCTTCC TCGGGCTGAT CGTCGGATCG GACGCTCGAC CCACCCTCGT GCTGAATGCG
CTGTTCGCCG CGCCGGCCGA CGCCGCCTGC ACCGTGCGCT CGTTCACCGA CACCGACGAT
CCGCTGGCCA TCGTCGAAGG GCTGTGCGAT GCCGACAAGC CGCTGGGCTG CGACAAGAAC
CTGCCTGCCC GCTTCCTGCT GCCGCTCATG GAGCGCGGCG CGGCGAGCGG CTTCGTGCTG
GCCTCTGATG CGGTAGACGA CGCGCGCGCC ATCAAGGACG ATACCGAGCG CGAGCTTATG
CGCGCCGCCA GCGCCGCGAA CGATGCCGCC ATGGACCGTT TCCGCCGGCT CGTGCACGAG
GGCGTCACCG AGGCCGACGT GGCCGGCCAG CTGGAGGCGA TCTACCGCGA ACTGGGCGCG
CAGGGCCACT CGTTCACCCC CATCGTCAGC TTCGGCGCGA ACGCGGCCGA TCCGCACCAC
GAGCCCGACG ACACGCCGCT GGCCAGCGGT GACGTGGTCC TGTTCGACGT GGGTTGCCGC
AAGGGCGAGT ACTGCTCCGA CATGACGCGC ACCTTCGTGT TCGGCGAGCC CAGCGAAAAA
CTGCGCGAGG TGCACGACAC CGTGCGGCGC GCGAACGAGG CGGCGCGCAA GCTGGTAGCT
CCCGGCGTGC GCTTCTGCGA CATCGACGCC GCAGCTCGCT CAATCATCGA AGAGGCGGGC
TACGGCTCCT ACTTCACGCA TCGCCTGGGA CACCAGATCG GCCTCGACGT GCACGAGCCC
GGCGACGTGT CGGCCGCGCA CGACGCGCCG GTGCAAGCGG GCATGGTGTT CTCCATCGAG
CCGGGCATCT ACCTGCCCGG CGAGTTCGGC GTGCGCATCG AGGACCTCGT GCTGGTCACC
GAAGACGGCT GCGAAGTGCT CAACAGCTAC CCCCGCGAGC TGGTTTCGAT CGGCTAG
 
Protein sequence
MYEQRIKTVR RNLANRGLEQ MLVCDPRSIH YLTGAFIEPG ERFLGLIVGS DARPTLVLNA 
LFAAPADAAC TVRSFTDTDD PLAIVEGLCD ADKPLGCDKN LPARFLLPLM ERGAASGFVL
ASDAVDDARA IKDDTERELM RAASAANDAA MDRFRRLVHE GVTEADVAGQ LEAIYRELGA
QGHSFTPIVS FGANAADPHH EPDDTPLASG DVVLFDVGCR KGEYCSDMTR TFVFGEPSEK
LREVHDTVRR ANEAARKLVA PGVRFCDIDA AARSIIEEAG YGSYFTHRLG HQIGLDVHEP
GDVSAAHDAP VQAGMVFSIE PGIYLPGEFG VRIEDLVLVT EDGCEVLNSY PRELVSIG