Gene Elen_2261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2261 
Symbol 
ID8416585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2657558 
End bp2658817 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID645025247 
ProductHipA N-terminal domain protein 
Protein accessionYP_003182610 
Protein GI257792004 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.573422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGC TCTCGGTATT CAGAATGGAC GGTTCCGAAC CGCGCCTACT CGGACGCATC 
AACCTCGATC CCGTCACGTT CTCCTACGAT CCCGGCTACC TCGCCGACCC CGACGCGCGA
GCGGTCTCCT GCTCGCTCCC TCTGCGAAAC GAGGCCTATA GCGAATCGAC GCTCGCTCCC
TACTTCGACG GCCTCTTGCC CGAAGGTGCG GCACGGCGGG CGATCGCGGC GCAGCTCGGC
ATCGAGCCCG ACAACTACCT GCTGTTGTTG CTCCGATGCG GGCTGGAAAC CATAGGGGAC
GTCGCGATCA CGAACGGGGA GGCGCCGAAT CCGCGAGGTT CGTACCGCCG TTTGAGCGGC
TTCGATCTTC GCGCCCTGTT CTCGGCCATG GAAGAGCTTG CCGAATCGAA TTCCGAAGCG
CGGCTTTCCC TTGCCGGCAC GCAAGGGAAA GCGGGGCTCG CGCATATGCC GGGCGCGCCC
ATGGATGAAG GCTGGCTCCA ACCTTTGGAC GGTGCCGCTT CGACCCATAT CCTCAAAACG
GGATCTCTTC CCGACATCCC GCTGCTCGAG TACCTGTGCA TGAAAGCGGC GGCCGCATGC
GGGATCGCCG TCGCACCAGT GCATCTGCTC GACTTCGGGA GGCCGGTACT ATGCGTCGAA
CGCTACGACC GACGGCAACT TCGAGACCGA GGAGAGCCAG TCGTTTCAAG GCTTCATCAG
GAAGACCTCT CACAAGCGTT CGGGGTGCCG CCCGAAGCGA AATACCGCGA GCTCGAGCCC
TCCACCGCCG CAGCGGTCGG CTCGTTCATC CGGATGCGAT CCTCTCGCCC TATCGAGGAT
CTGGAAGCGT TTGCGCGGAT CACGTGCTTC AACTACCTCG TAGGCAACTG CGACAATCAC
TTGAAGAACC TGTCGATCCT GTACACGAGC ACCTGGAAGA GCTTCCGGCT TGCGCCTGCG
TACGACCTCG TTTCGACAAC TCGGTTCGAA CGGTTCTCGC GATCGATGGG GATGAGGATC
GGATCGGCAT CGGTCATCGA CGACGTCTCG CCTAGCAGCA TTTTGGAATT CGGCTCCGCG
ATCGGCGTGG ATCGCAAGGA CATCGCGCGT ATCTGCGCCG AACTCGCGAG CAGTGTACCC
AGCGCCATCG AAATCGCAGG TGAAACCGCT CCTGCATTCG AAGCCTTGCC GTACGCAGCC
TGGGACATGC GCGAAGACAT GGGCGAGCGG ATGCGCGTGC TTGAAGAAGT CGCCGGCTGA
 
Protein sequence
MDALSVFRMD GSEPRLLGRI NLDPVTFSYD PGYLADPDAR AVSCSLPLRN EAYSESTLAP 
YFDGLLPEGA ARRAIAAQLG IEPDNYLLLL LRCGLETIGD VAITNGEAPN PRGSYRRLSG
FDLRALFSAM EELAESNSEA RLSLAGTQGK AGLAHMPGAP MDEGWLQPLD GAASTHILKT
GSLPDIPLLE YLCMKAAAAC GIAVAPVHLL DFGRPVLCVE RYDRRQLRDR GEPVVSRLHQ
EDLSQAFGVP PEAKYRELEP STAAAVGSFI RMRSSRPIED LEAFARITCF NYLVGNCDNH
LKNLSILYTS TWKSFRLAPA YDLVSTTRFE RFSRSMGMRI GSASVIDDVS PSSILEFGSA
IGVDRKDIAR ICAELASSVP SAIEIAGETA PAFEALPYAA WDMREDMGER MRVLEEVAG