Gene Elen_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0044 
Symbol 
ID8414323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp58422 
End bp59630 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID645023019 
Producttype II secretion system protein 
Protein accessionYP_003180427 
Protein GI257789821 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCT TCACTTACAC CGGGATAACG GCCGCCGGCC AGCAGATCGA CGGCGTGGTC 
GAGGCGTTCG ACGAGATCGA GGCCATGGAA CGCGCGCGCG AGCAGTGCCG TGTCGTCCAA
AGCGTGGTCC CCGTGCGCGA GGGCAAAAAC CTGCTGTCCA TGGACATCAC GAAGCCGAAG
GCCAAGCAGA AGAACCTTGC CGTCATGTGC GCCCAGTTCG CCACCATCCT GAACGCGGGC
GTGCCCGCCG CGCGCGCCAC GTCGCTGGTG GCCGACCAGG TGACCGACAA GTATCTGAAA
CGCGTGCTGG CCGACGTGGC CGCCGACGTG GCGTCGGGCC ACAGCCTGGC CGAGAGTTTC
CAAAGCAAGG GCGAGAACCT GCCGCGCGTG TTCATCGAGA CGGTGCGCGC CGGCGAGGAG
AGCGGGCATC TGCCCGAGAG CTTCCAACGC CTGCACGGGT ACTTCGACAA GCGCGCGAAG
GTGTCGGCCA AGGTGCAAAG CGCCGTGACC TACCCCATCT TCGTGGCGGT GATCGCCGTG
GTGGTGCTGG CCATCATGAT GGTGATGGTG ATCCCCTCGA TGACGGGCAT GATCGCATCG
CTGGGGGCGG ATACGCCGGC CATGACCCAG TTTTTGATCG ACGCTTCGAA CTTCGTCACC
GACAACTTCC TGCTGATCGC GGTGGTGCTG GCGCTGATCG TCGTGGGCGT GAAGCTGTTC
GGCACCACGG AGCGCGGCAA GACGACGTTC GCCGTTTTGA AGCTGCGGCT GCCGGTGCTG
GGCGCGGTGG GCGTGTGCTC GGGCGCGGCT CAGTTCGCGA ACACGATGGC CATGCTGGTG
ACGGCCGGCC TTCCTGCCAC GCGGGCGGTG GCCATCACGT CGCGCGTGAT GAGCAACTAT
GTGCTGTCGC GCGAGGTGGG GCGCTTGGAG GCCGGTTTGG AGGAAGGGCG CACGCTGGGC
GAGGGCCTGG AGGCCAGCAC GTACCTGCCG CGCACGCTGG TCGAGATGGT CACCGTGGGT
GAGCACACCG GCGAGCTGGA GGAGACGCTG GAAACCATGG GCGCGTTCTA CGACGACGAG
ACGCAGCGCG TGACCAACAA GGCGATTTCC ATCATGGAGC CCGCCCTGCT TGTTTTGATG
GCGTTGTTCG CGGGTTTCAT CGTCATCGCG TTGTATTTGC CGATGTTTTC ATTGTATGCT
GCAATGTAA
 
Protein sequence
MPTFTYTGIT AAGQQIDGVV EAFDEIEAME RAREQCRVVQ SVVPVREGKN LLSMDITKPK 
AKQKNLAVMC AQFATILNAG VPAARATSLV ADQVTDKYLK RVLADVAADV ASGHSLAESF
QSKGENLPRV FIETVRAGEE SGHLPESFQR LHGYFDKRAK VSAKVQSAVT YPIFVAVIAV
VVLAIMMVMV IPSMTGMIAS LGADTPAMTQ FLIDASNFVT DNFLLIAVVL ALIVVGVKLF
GTTERGKTTF AVLKLRLPVL GAVGVCSGAA QFANTMAMLV TAGLPATRAV AITSRVMSNY
VLSREVGRLE AGLEEGRTLG EGLEASTYLP RTLVEMVTVG EHTGELEETL ETMGAFYDDE
TQRVTNKAIS IMEPALLVLM ALFAGFIVIA LYLPMFSLYA AM