Gene Elen_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2032 
Symbol 
ID8416343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2380434 
End bp2381660 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID645025009 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_003182385 
Protein GI257791779 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0688157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000748168 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGCGA ACAACACTCA AATCAAGAAA GACTACTTCT GGAACACGCT CGGTAGCGTA 
ATGAGCGCGC TTGCGAGCGT GCTCCTGCTG ATGATTGTGA CACAGGTCCT CGGAGCGTAC
GAGGGCGGAA TATTTGCCTT GGCGTTTGCC GTAGCGCAGC AGTTTCAAAC GCTGGGTCAG
TATGAAATGA GGCCATATCA GGCGACCGAC GTCGAAAGCA AATACTCATT CGGGGTATAC
TATGCGTCGC GCATAGTGAC TTGCGTTCTC ATGGTGGTTG GAGTTATGTT GTACGCGATG
TACTCGAATG GATTCGGGTA CGAGGCGCTG TTGCTTATTC TTCTTGCAGG GTTGCGGTTC
TTTGACGCAT TCGAAGACGT ATTTCATGGA ATGTTTCAGC AGCGTGGACG GCTGGATATT
GCGGGTAAAG CATTCTTTGG TCGCGTATTG GCGACGGTTG TGTCATTCGC TCTATCGATA
TTTGTTTTTC GGGACATCCT GGTGGCGTGC GTCATTTCCA TTATCGTTTC AACGGTATTT
ATGATCGTCT TGAACATTCC GGCTGCCAGG ACGTTCGTCA GAGTGAAGCC GATGTTTGAC
TTTCGAAAAA TACGACAATT GCTCGTCGCT TGCTTTCCTC TTTTTTTGGG CTCGTTCTTG
TTGATATATC TCGTAAACGC TCCGCGCTAC GGTATCGAGA GCCTGCTCTC GAAGGAGTAC
CAGACGTATT ATGCCATTCT TTTCATGCCG GCCATGGTCA TCAACCTCGT AAGCGGTTTT
ATATTCAAGC CGTTGCTTAC CACATTGGCG AAACGATGGT CCGAAGGCTC GAAGCGGGGT
TTCGTGGCGA TTATCGCAAA AGGGTTGCTA GCGGTGTTCG CTACATCCGT TCTTGCATTG
CTGATAGCAT ATCCTTTGGG CATCCCGGTG CTTTCGTTTC TGTACGGCGT CGATTTGTCG
GAGTTCCGAT CGGTGCTTCT CGTGCTTTTG GTCGGAGGCG GATTCAATGC TGCGAGCGTT
ATTCTCTATT ACGGGATCAT AACCACGCGC CACCAAAACA TTGTTTTGCC GGGATATGCG
CTTGCTGCAG CGGTTGCCTT TCTTCTGACC AATGCGATGA TCGGGCAATT TGGAATGATG
GGTGCAGCCT TTCTCTATGA TGGCATCATG GCGCTTCTTG TGCTCATCTT CGGTCTGTGC
AATGTGTATT ACATCAAGCG CGGGTAA
 
Protein sequence
MAANNTQIKK DYFWNTLGSV MSALASVLLL MIVTQVLGAY EGGIFALAFA VAQQFQTLGQ 
YEMRPYQATD VESKYSFGVY YASRIVTCVL MVVGVMLYAM YSNGFGYEAL LLILLAGLRF
FDAFEDVFHG MFQQRGRLDI AGKAFFGRVL ATVVSFALSI FVFRDILVAC VISIIVSTVF
MIVLNIPAAR TFVRVKPMFD FRKIRQLLVA CFPLFLGSFL LIYLVNAPRY GIESLLSKEY
QTYYAILFMP AMVINLVSGF IFKPLLTTLA KRWSEGSKRG FVAIIAKGLL AVFATSVLAL
LIAYPLGIPV LSFLYGVDLS EFRSVLLVLL VGGGFNAASV ILYYGIITTR HQNIVLPGYA
LAAAVAFLLT NAMIGQFGMM GAAFLYDGIM ALLVLIFGLC NVYYIKRG