Gene Elen_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1644 
Symbol 
ID8415943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1944501 
End bp1945637 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID645024613 
Producthypothetical protein 
Protein accessionYP_003182001 
Protein GI257791395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.112699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00252347 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGACG ACGAGGCGCG CGAGCGGGCG CGCAGGAGGC TCGAGGAGCG CAAGGCGCGC 
ATGCGCGGGG AAGCGCCCAC CGGCCGCGAT ACGCACGGCG TGCATGACGC GCGCGAGCCG
ATAAGCTCGG GTCGTCTCCG TTCGAGCCGT CCTGGAACGG GCAATCGGCA CTCATGGGAG
GCGCACGGCT CGGAGCTCCT GTTCGCCTTC TCCGAGGCTG CGACGAACCT CGTGCGCGCC
GTCGGTCCCA AGCGCCTCGC GATCGCGGCG GCCGCGATCG TGCTGGTCGT CGTGCTTGTC
GCGGGCGTTC GCGGCTGCAT GGCCGCCGGC GCTGCGACGC AGGCGCCGGA CGAGGCTGAC
CGGGCTCCCG TCCAGCAGCA GACGCAGCGC GATCCTATCG ACGAGGCCAA GCTCAAGGCC
GTGCTCGGCG ACGATTTGGC CGCCCAGCTC GTCCAAGCGG CTTCGGCGAG CGACGACGCG
GCATGGATCG CCGCCCATCC GGACGCCTAC GCCGTGGACG GCGAAGCGGT GCAGCGCAAG
CTGCTCAAGC TGGCCGCCGT CGAGCCCGAG GCCGTGCCCT TCGTGCGCAC GTTTCCCGAC
GCCTATCCGG CCGAGAGCGC CCTCGGTACG GACGACCCCG CCTCAGGCGA GGTGCCGCGT
CTCTACCAGT GGGATCAGCG CTGGGGCTCC ACCGTGTACA GCTCCACGAC GTTCGCGCTG
ACGGGATGCT GCCCCACGTC GCTTTCCATG GTGTACCAGG GCCTCACCGG CAAGGGCGAT
CTGTCGCCCT ACGATATGGG GAAACGTGCG AGCGACGGCG GCTTTGAGAC GGCGTTCGAC
GGCACCGACT CCTCGTTCCT CGTGAGCGAG GCAGCCTCGC TCGGCCTTTC CTGCGAGGCG
CTCTCGGTCG ATGCGGGCAG CGTGCGCGCG GCGCTCGAAG GCGGCGCCGT GCTCGTCTGC
AACGTCGGCC CTGGAGACTT CACCGACAAC GGCCACTTCT TCGTCGTCAC CGGCATCGAC
GGCGACGGGA ACCTGCGCAT CAACGATCCG TACTCGGCCG AGCGCTCGAA CAGAGCCTGG
AACGTGGACA CGGTGCTCGG CCAGACGAAG GCGCTGTGGG CCTACCGGCT GGCCTGA
 
Protein sequence
MGDDEARERA RRRLEERKAR MRGEAPTGRD THGVHDAREP ISSGRLRSSR PGTGNRHSWE 
AHGSELLFAF SEAATNLVRA VGPKRLAIAA AAIVLVVVLV AGVRGCMAAG AATQAPDEAD
RAPVQQQTQR DPIDEAKLKA VLGDDLAAQL VQAASASDDA AWIAAHPDAY AVDGEAVQRK
LLKLAAVEPE AVPFVRTFPD AYPAESALGT DDPASGEVPR LYQWDQRWGS TVYSSTTFAL
TGCCPTSLSM VYQGLTGKGD LSPYDMGKRA SDGGFETAFD GTDSSFLVSE AASLGLSCEA
LSVDAGSVRA ALEGGAVLVC NVGPGDFTDN GHFFVVTGID GDGNLRINDP YSAERSNRAW
NVDTVLGQTK ALWAYRLA