Gene Elen_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2222 
Symbol 
ID8416544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2609372 
End bp2611003 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content62% 
IMG OID645025207 
Producthypothetical protein 
Protein accessionYP_003182572 
Protein GI257791966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0859777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATG AGAGGTTCGA CAACGAAGAC GAGCTGCGCC GGGCGGTGAT CCGGAATTTG 
GACGCGAGCC CGATTCCCCC GCAGGTGCAG GTCAAGCTCG ACGGCGTGTA CGCGTCGCTG
GGCTCCATTC CTCAGGATCG CCCCACGCCG TCGGGCGCGG GCGCTCCGCA GCGTCGGCAG
CCGGTCAAGC GGCGGTCTGC CGAGCCTGCG CACGGCAAGC GCAAGGGCGC CAGCGTGGCG
CGCCGCGGTG CGATGGTGGC GGTTGCGGCC GTGCTCGTCG TGCTGTTGAG CGGTGTCGCC
TTCGCCGCAT CGCGCCTGGT GCAGATGCAG CCGGGCGATG TCGGATTTTT CGGGGGCGGT
AACAACCTGC CCATATACAA CAGCTTGCAG CCCGGGGTTT CCAGCCTGAA CGCCGAGGTG
GGCGATACCG TTGAAGTCGA CGGCGTGCAG GTGACGCTCG ATTCGGTGTC GTGCGATCGC
AACATCGTCA ACCTGTTCTT CACGTTGGAG AAGGAGGGCG GCTTCGACCT GACCGAGCAG
TCGAACTACG AGGGCTCCCA GGAAAACGAA TGGGCGCGCT TGCAGCGGCT TGCTCCGCGC
TTCTCGTACA GTCTTTCGAG CAACGGCGAG GCGATCGGCA AAGATTCCGT CTACGTGCTC
GATGCGTACC AAAAAGACGG CAAGGTGAAG ATCATGGAGC GCATCGTGCC GGAAGCGACG
CTTCCCGACC AGGTGGACAT CGCGCTGGAA GGCTATGCGA TGTGGAAGCA GTTCGAAGAA
GGAGACGAGC CCTTCACGTT CGATGTCGGC CTCGACCTGA GCACGGTGGC CAGTCCGCGC
GAGCTGGGCG CGCACGACCT CGTGTTCAAC ACGAGCGACG GCGACAAGAC GATGGGCATC
CAGCGTTTCA CGGCATCCGA GCTGGGCACC GTGATGGTCG TGCGCAACGA CAACGAGTGG
ACGGGAGAGC AAGGCGAATA CGGTTCTTCC TACGGCCCGC CCGAGAACGT GCTGAGTCCT
CATTTGCTTA AGGTAACCGA TGACCAGGGC AACGTTTTGA CTCCGGTCGA AGCTGGCGAT
GGTTCGGGCG TCAATCCGGA GGGTTCGCAG ATTATCGAGT TCTCCAATCT CTCGCCCGAA
GCGCATAGCG TCACGTTCAC GCCGATGTTG AACGCGCTCG ACTGGGACTC GATGACGGTC
GAGGAGCGTA AAGCGAGGAA TGAGGAAAAC GTACAACATG TGGACGTCTC TCGAATCGGC
ACCACATTGG AGACGAGCGA GTTCGGCGGC TACGAGCTGA CCGGCTGGGA CGTGACCGAT
GGAACGGTGA GCATATCGCT CAAGCCCTAC GGATGGCAGG CTATGGGACC GTACATGGAG
CTCATTTCCG AAGACGATGT GACGCTCTTG GAGAGCACAT GGACGGATCC CGAGACGGGC
GAGACGGGAA CCGGCTACCA TTCGGGCATC ATGTATCGCA AGCACGACTA TATGACCGGC
GAGTTCGTCC AGATGGTGTC GTACTACGCC GCAGACGATG ATGAGCTGCG CGGGCTCACG
AACTACAGTT ACCGCTCTGC GTTCGGTGAG TATCGGGAAG AGCCAGACGC GGCGCAGACG
CTCTCGTTCT AA
 
Protein sequence
MSHERFDNED ELRRAVIRNL DASPIPPQVQ VKLDGVYASL GSIPQDRPTP SGAGAPQRRQ 
PVKRRSAEPA HGKRKGASVA RRGAMVAVAA VLVVLLSGVA FAASRLVQMQ PGDVGFFGGG
NNLPIYNSLQ PGVSSLNAEV GDTVEVDGVQ VTLDSVSCDR NIVNLFFTLE KEGGFDLTEQ
SNYEGSQENE WARLQRLAPR FSYSLSSNGE AIGKDSVYVL DAYQKDGKVK IMERIVPEAT
LPDQVDIALE GYAMWKQFEE GDEPFTFDVG LDLSTVASPR ELGAHDLVFN TSDGDKTMGI
QRFTASELGT VMVVRNDNEW TGEQGEYGSS YGPPENVLSP HLLKVTDDQG NVLTPVEAGD
GSGVNPEGSQ IIEFSNLSPE AHSVTFTPML NALDWDSMTV EERKARNEEN VQHVDVSRIG
TTLETSEFGG YELTGWDVTD GTVSISLKPY GWQAMGPYME LISEDDVTLL ESTWTDPETG
ETGTGYHSGI MYRKHDYMTG EFVQMVSYYA ADDDELRGLT NYSYRSAFGE YREEPDAAQT
LSF