Gene Elen_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0038 
Symbol 
ID8414317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp49982 
End bp54097 
Gene Length4116 bp 
Protein Length1371 aa 
Translation table11 
GC content59% 
IMG OID645023013 
ProductGLUG domain protein 
Protein accessionYP_003180421 
Protein GI257789815 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4970] Tfp pilus assembly protein FimT 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA GGGGACGGCA TACGGAGGGC TTTACGCTCG CCGAACTTCT GATGAGCGTT 
GCCATCATAC TCATTCTTGC TGCCATCGCC TTTCCTTCCA TCGTGTCGGC GCAGAACAAC
ATGCGCATGT TGGAGCTGAA CAACGCCGCC CAATCCATCG CGAACGCCGC CCAAGCCCAG
ATGACGGCCA AGAAAGTATC GGGCACGTGG GTGGATGCCG TGAAGGACGG CGACTCTTAT
CGCGCCTGCT TCCCCGCCGC GCTTGCGGGC TCATCCCATT CGCAAGGCGA AACGGAAGAT
GTTTCACGTG AAACTCTCGG GTCCGACGAG GTTGCTCCTC CTTCCGCTTC TGCTGCGGAG
TCTTCCAAAC GTTACATGAC GGCGGACACC GCCCGCGAGC AGGGCATCGT CCCGGCGCTC
GCCGTGGAGG AGGCTGTGCG CGACGCCGAC TATATCATCG AGTTCGATGC CGACACGGCG
CAGGTGACCG GCGTGTTCTA CGCCGACGGA AGGTCCGGCT TCTTCGGATC GACCCCTGCC
TCCACGAACG CCGCGAAGAC GTATTACGAA ACCGAGGGCG CCTCCACCGA TCAGGCGGCG
CGCATGGGCC ACGACCCCAT GATCGGCTAC TATGGCGGCA CCCCTGCCGG CGCCACGCCC
GAGAAAGCGC TCGCGAACCC GGTGATCTGG GTGGACGAAG CAACGGGCTG TCTGATGGTG
CAGGACCCCA ACATTGCCGC GGACGGAAGC GCGGGCTCCA CCACCTCCAC GGTGGCTATC
GAGAACACCG GCAAGAACGT GGCGTTCTCG ATCTCCGGCT TGAGCAACGG AACAACCATG
GTGTCGTTGT ACACGGCCGA TGACGGTTCG GAGTCGGTTG GCTTCACCAA CTTCGCCGCA
GCCATCAAGC AGCAGACGCG CGACAACGCG AACGTGAAGG GCAATGTGTG GGCCATCGAC
CTCAACGCAC TTTCGCAGCT GGTGGCGAAG GGGAATGACG GCAAGCCGGC GGCGGATGAC
TCGCAGAAAG CGAAGCTCAA GCAGGTGTTC GACGCGTGTG TGGCGGGCGA CGCCCTGACC
GTGTCGGTGG AGACGAAGGA CGCCTCGCGC AGCTGCGTGC CGGGCACGGC TGCCGCCCAT
GTGGAGTGGC CGAGCCCTGC GGGCAAGCTG ACGATGCTAA TAACCAACCC GTACTCCGTC
GTCGTGGCCG GCGAGAAGAA CGAAGCTTAC ATCGAGCCGC AGGTGCGGTC GGCGGTCGCG
GACAGCGCGC ATCCTTCCAT TGGGGCGGGC TTCGGCAACG GCGACGGCGT GGTGAAGGAC
GGGCTGACGG TCAACCCCTT CTATGAAGAT AAAGATAATC ACTTCCGCGT TTCCAACGCG
AACGCGCAGT TGAAGCAGGA GAACCCGCAG GCCGGCTACC AGTCGTACGC GGGAGGCTGG
ATAGCCTCGT CGTCCGTGCG CGACGACGCC ACGTACCGGC TCGAGGGAAC TGTGGGCGCG
TACAACAATC ATGCGTACCA GATTTGGGAG CTGTGGATCA AGCGAGCCGA CACCGGTGAA
TGCATGCGCG TGGGCTACTT GAACGATGGC AAGTGGGAAT GGGGCGTGTT CAACCAGAAG
GGGGTCAACT ACGACTACCG TTTCCTGAAC GACTGCTTCA CCTGGTACGG AACGAATGAA
ACGGGCGATG CTTCGGGTAC CCTTGCCGGA ACCGATACCG ATACGAACAA CGTCATATCC
CTGCGCCTCG ATGTGCAGAA GTTCTACGCC GAGGCCGAGA ACCATAAGAA CCACGGCTTG
GCCGACGAGG ACGGCAATGC AACGGTGTAC GTGCGCACGG CTCCGAAGGC ATCCGAAGTC
CAGGCGTACT TCAACAAGCT GGCGGTTCCT GAACCGGCTT CGGTTGAGAA TCCGTTGAAG
GCTGCCTATC TGAGCGGCAG CGCCCAGGAG ACCGGCTCGC GCACCGCGGA TACCCCGTCG
GTCACGGCAC GCGCCGCGTT CGAAGGCGAG TTCGGCGCCT CATCGTCCGA CGTGTCGTGG
GCGGTGTCCC AAACCACGAC CGCCGGGTTT TCCCAGGGGA GCGAATACCT GAGCACCGCC
GCCACCGTGC CGGTGCGGGT GTACTACTCC ATCGCCCCCG GGGTTGGTTT CGCGAACATC
AGATCTTACG ATAACGGCAA CGGCAGCGGG TACCTAAGCG GCGTGCTCAG CACGCGCTTG
ACCAACGTCT CGCTTTGGCT GTATCGCGGT CCCTCTATCA ACGATCTTGC CGTTATGCCA
CCCGCCCTTC TGAAGAACTA CAAGGGGTTG GAGTTCTCGT GCCGCCAAGG AAGTACGTAC
GACTTCAAGA TAACCACCCA AGAAGACTAC CGCTTCTACC GGGCGCTCGC ATACACCGTC
GAGAATGGAA GTGCGCCGCC ATCTCAATAC GTCCCGCACG CATCGGCAAG CGATGAGAGC
GTCGCGAAAA TCGCCGCCGC CGAAAACTAC GAAACGGACG ATAAGCTGTA CACGTTCAAA
GGGTGGACAA CCAAGGATAC CATGTCGGGT GCAGAACTTC TCGTCGAAGC AGACAAGCTG
GTTTCCGACT ACGACGGACT CAGCTACCAA GGCACGACGC TTGTCGCAAG CTACGACGAG
CGGAAGAAGG TGCAGCCGTC TTTGGGCATG ATGTACATAG AAACCGGCAC CGACAAAACG
GGTTCGCCGG CGTACGGTTA CTATGGGTAT ATCGAGAACA ACGCGGCTGC GGTCGAAAAC
TTGCTGTCCA ACGAGTACTC CGTCACCGAC GGTGGCTATT ACGTGGTGGT TCCCACTGGC
AGCAATCAGC CGAAAATGAC GGGCAGAGGC GACATCCCGA ACTATCTCAA GGACTTCTCC
GACGTGCTGC AGGGTTTGTC CATCGAAGGA GGCTTATACG ATTGCTATCG CATCACGGTG
AGCGATAAAG ACGTGGGGCA AAACCGCCCG TACGACAAAT ATAAACTGAA GAATCAGACA
CTGTCGTTCA AGGCTGATAT TCAGACAGGT GCGAGCACGG TCTCCGTCGA GGGCACGTAC
ACGGTGAACT TCGCATTCGC TTGCGCTGTC GAAACGAATG AAGAGGCCGC CCTTTCATGG
GGAACGAGCG TTTCGCCCTG GAACGTTCGA CTGGGACGGC AGTTCGTCGG CAATTTGACA
GCTGGAAGCA ACAATGTGCA GGGGAAGTAT GCTCAATATA ACTGCTTTAT CCAAACGCGG
AACATCGATT TGGCCGATGC GCCTGTCGTG AGCACGTCTA AATTCACTCA GCCGTTCTTA
GGCTCGACGT ACGACGGAGG CGGCAACAGC ATATTCGGAA TACCGTACCG ACTGGCACAT
GAAGGGATTA CCGGTGAGAA CCAGGGAAGA AACGGTCTGT TTCTTGATGT TGGCGGGGAG
AGCCTGATTA AAAACGTGAA CATCGTGCTG GATCCCACCG ATGAAGGCTC AGAGTATGTG
TTCGTTTCCA CCCATGGAAG CAAACTTCGA TTCGGACTCC TCGTCGGGTC GATTGACGGA
GACGGAGCCG ATATTCAAAA CTGCACGGTC GGCGTTGCCG GCAAGGAGAC GGCTACGCTT
AGGATCATCA AAAAGGGTAC AGGTGGCGAG GCGTATATCG GCGGTTTGGT GGGCTATGCA
AGTCAAGCCA ATGTGAGCGA TCTATCGGTC AAGAATCTTC GTATTGAGGT TGTCGCCGAT
GTCGAAGCAT GGAAAACCTC TCCCGCGATC GGGGGGATCT TCGGCTACGG CTCGCAGTTG
AACATGTCGA GGAGCTCCGT CGATGGCTTC GCATGCGCTC TTTTCCAGCC TACGTTCGTT
AACGAGAAAT ATCCCAGCCA AAACACGCGC ATCCATTTCG GCGGTTTGGT CGGCAGCGCT
CAAATGGCAG TCATTAGCCA GAACGTGAAG AGCAACGCGG TTCTGCAGGT TCCTGCCGGC
CAGTTGCGAG ATTCTGTTGT TGCTGGGCGG TACGTTGGGC AAGCATCAAT GAGCGCGGTT
GCTCAGAACG AATCCGGGCA GGTGTTCGTC AAATACGGCG ACCCGAGTGA AGGCGGGGAA
ACCGTTGAGG TGACTGCCGA CGTCGGGGTG CAGTAA
 
Protein sequence
MAKRGRHTEG FTLAELLMSV AIILILAAIA FPSIVSAQNN MRMLELNNAA QSIANAAQAQ 
MTAKKVSGTW VDAVKDGDSY RACFPAALAG SSHSQGETED VSRETLGSDE VAPPSASAAE
SSKRYMTADT AREQGIVPAL AVEEAVRDAD YIIEFDADTA QVTGVFYADG RSGFFGSTPA
STNAAKTYYE TEGASTDQAA RMGHDPMIGY YGGTPAGATP EKALANPVIW VDEATGCLMV
QDPNIAADGS AGSTTSTVAI ENTGKNVAFS ISGLSNGTTM VSLYTADDGS ESVGFTNFAA
AIKQQTRDNA NVKGNVWAID LNALSQLVAK GNDGKPAADD SQKAKLKQVF DACVAGDALT
VSVETKDASR SCVPGTAAAH VEWPSPAGKL TMLITNPYSV VVAGEKNEAY IEPQVRSAVA
DSAHPSIGAG FGNGDGVVKD GLTVNPFYED KDNHFRVSNA NAQLKQENPQ AGYQSYAGGW
IASSSVRDDA TYRLEGTVGA YNNHAYQIWE LWIKRADTGE CMRVGYLNDG KWEWGVFNQK
GVNYDYRFLN DCFTWYGTNE TGDASGTLAG TDTDTNNVIS LRLDVQKFYA EAENHKNHGL
ADEDGNATVY VRTAPKASEV QAYFNKLAVP EPASVENPLK AAYLSGSAQE TGSRTADTPS
VTARAAFEGE FGASSSDVSW AVSQTTTAGF SQGSEYLSTA ATVPVRVYYS IAPGVGFANI
RSYDNGNGSG YLSGVLSTRL TNVSLWLYRG PSINDLAVMP PALLKNYKGL EFSCRQGSTY
DFKITTQEDY RFYRALAYTV ENGSAPPSQY VPHASASDES VAKIAAAENY ETDDKLYTFK
GWTTKDTMSG AELLVEADKL VSDYDGLSYQ GTTLVASYDE RKKVQPSLGM MYIETGTDKT
GSPAYGYYGY IENNAAAVEN LLSNEYSVTD GGYYVVVPTG SNQPKMTGRG DIPNYLKDFS
DVLQGLSIEG GLYDCYRITV SDKDVGQNRP YDKYKLKNQT LSFKADIQTG ASTVSVEGTY
TVNFAFACAV ETNEEAALSW GTSVSPWNVR LGRQFVGNLT AGSNNVQGKY AQYNCFIQTR
NIDLADAPVV STSKFTQPFL GSTYDGGGNS IFGIPYRLAH EGITGENQGR NGLFLDVGGE
SLIKNVNIVL DPTDEGSEYV FVSTHGSKLR FGLLVGSIDG DGADIQNCTV GVAGKETATL
RIIKKGTGGE AYIGGLVGYA SQANVSDLSV KNLRIEVVAD VEAWKTSPAI GGIFGYGSQL
NMSRSSVDGF ACALFQPTFV NEKYPSQNTR IHFGGLVGSA QMAVISQNVK SNAVLQVPAG
QLRDSVVAGR YVGQASMSAV AQNESGQVFV KYGDPSEGGE TVEVTADVGV Q