Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0038 |
Symbol | |
ID | 8414317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 49982 |
End bp | 54097 |
Gene Length | 4116 bp |
Protein Length | 1371 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645023013 |
Product | GLUG domain protein |
Protein accession | YP_003180421 |
Protein GI | 257789815 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4970] Tfp pilus assembly protein FimT |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGA GGGGACGGCA TACGGAGGGC TTTACGCTCG CCGAACTTCT GATGAGCGTT GCCATCATAC TCATTCTTGC TGCCATCGCC TTTCCTTCCA TCGTGTCGGC GCAGAACAAC ATGCGCATGT TGGAGCTGAA CAACGCCGCC CAATCCATCG CGAACGCCGC CCAAGCCCAG ATGACGGCCA AGAAAGTATC GGGCACGTGG GTGGATGCCG TGAAGGACGG CGACTCTTAT CGCGCCTGCT TCCCCGCCGC GCTTGCGGGC TCATCCCATT CGCAAGGCGA AACGGAAGAT GTTTCACGTG AAACTCTCGG GTCCGACGAG GTTGCTCCTC CTTCCGCTTC TGCTGCGGAG TCTTCCAAAC GTTACATGAC GGCGGACACC GCCCGCGAGC AGGGCATCGT CCCGGCGCTC GCCGTGGAGG AGGCTGTGCG CGACGCCGAC TATATCATCG AGTTCGATGC CGACACGGCG CAGGTGACCG GCGTGTTCTA CGCCGACGGA AGGTCCGGCT TCTTCGGATC GACCCCTGCC TCCACGAACG CCGCGAAGAC GTATTACGAA ACCGAGGGCG CCTCCACCGA TCAGGCGGCG CGCATGGGCC ACGACCCCAT GATCGGCTAC TATGGCGGCA CCCCTGCCGG CGCCACGCCC GAGAAAGCGC TCGCGAACCC GGTGATCTGG GTGGACGAAG CAACGGGCTG TCTGATGGTG CAGGACCCCA ACATTGCCGC GGACGGAAGC GCGGGCTCCA CCACCTCCAC GGTGGCTATC GAGAACACCG GCAAGAACGT GGCGTTCTCG ATCTCCGGCT TGAGCAACGG AACAACCATG GTGTCGTTGT ACACGGCCGA TGACGGTTCG GAGTCGGTTG GCTTCACCAA CTTCGCCGCA GCCATCAAGC AGCAGACGCG CGACAACGCG AACGTGAAGG GCAATGTGTG GGCCATCGAC CTCAACGCAC TTTCGCAGCT GGTGGCGAAG GGGAATGACG GCAAGCCGGC GGCGGATGAC TCGCAGAAAG CGAAGCTCAA GCAGGTGTTC GACGCGTGTG TGGCGGGCGA CGCCCTGACC GTGTCGGTGG AGACGAAGGA CGCCTCGCGC AGCTGCGTGC CGGGCACGGC TGCCGCCCAT GTGGAGTGGC CGAGCCCTGC GGGCAAGCTG ACGATGCTAA TAACCAACCC GTACTCCGTC GTCGTGGCCG GCGAGAAGAA CGAAGCTTAC ATCGAGCCGC AGGTGCGGTC GGCGGTCGCG GACAGCGCGC ATCCTTCCAT TGGGGCGGGC TTCGGCAACG GCGACGGCGT GGTGAAGGAC GGGCTGACGG TCAACCCCTT CTATGAAGAT AAAGATAATC ACTTCCGCGT TTCCAACGCG AACGCGCAGT TGAAGCAGGA GAACCCGCAG GCCGGCTACC AGTCGTACGC GGGAGGCTGG ATAGCCTCGT CGTCCGTGCG CGACGACGCC ACGTACCGGC TCGAGGGAAC TGTGGGCGCG TACAACAATC ATGCGTACCA GATTTGGGAG CTGTGGATCA AGCGAGCCGA CACCGGTGAA TGCATGCGCG TGGGCTACTT GAACGATGGC AAGTGGGAAT GGGGCGTGTT CAACCAGAAG GGGGTCAACT ACGACTACCG TTTCCTGAAC GACTGCTTCA CCTGGTACGG AACGAATGAA ACGGGCGATG CTTCGGGTAC CCTTGCCGGA ACCGATACCG ATACGAACAA CGTCATATCC CTGCGCCTCG ATGTGCAGAA GTTCTACGCC GAGGCCGAGA ACCATAAGAA CCACGGCTTG GCCGACGAGG ACGGCAATGC AACGGTGTAC GTGCGCACGG CTCCGAAGGC ATCCGAAGTC CAGGCGTACT TCAACAAGCT GGCGGTTCCT GAACCGGCTT CGGTTGAGAA TCCGTTGAAG GCTGCCTATC TGAGCGGCAG CGCCCAGGAG ACCGGCTCGC GCACCGCGGA TACCCCGTCG GTCACGGCAC GCGCCGCGTT CGAAGGCGAG TTCGGCGCCT CATCGTCCGA CGTGTCGTGG GCGGTGTCCC AAACCACGAC CGCCGGGTTT TCCCAGGGGA GCGAATACCT GAGCACCGCC GCCACCGTGC CGGTGCGGGT GTACTACTCC ATCGCCCCCG GGGTTGGTTT CGCGAACATC AGATCTTACG ATAACGGCAA CGGCAGCGGG TACCTAAGCG GCGTGCTCAG CACGCGCTTG ACCAACGTCT CGCTTTGGCT GTATCGCGGT CCCTCTATCA ACGATCTTGC CGTTATGCCA CCCGCCCTTC TGAAGAACTA CAAGGGGTTG GAGTTCTCGT GCCGCCAAGG AAGTACGTAC GACTTCAAGA TAACCACCCA AGAAGACTAC CGCTTCTACC GGGCGCTCGC ATACACCGTC GAGAATGGAA GTGCGCCGCC ATCTCAATAC GTCCCGCACG CATCGGCAAG CGATGAGAGC GTCGCGAAAA TCGCCGCCGC CGAAAACTAC GAAACGGACG ATAAGCTGTA CACGTTCAAA GGGTGGACAA CCAAGGATAC CATGTCGGGT GCAGAACTTC TCGTCGAAGC AGACAAGCTG GTTTCCGACT ACGACGGACT CAGCTACCAA GGCACGACGC TTGTCGCAAG CTACGACGAG CGGAAGAAGG TGCAGCCGTC TTTGGGCATG ATGTACATAG AAACCGGCAC CGACAAAACG GGTTCGCCGG CGTACGGTTA CTATGGGTAT ATCGAGAACA ACGCGGCTGC GGTCGAAAAC TTGCTGTCCA ACGAGTACTC CGTCACCGAC GGTGGCTATT ACGTGGTGGT TCCCACTGGC AGCAATCAGC CGAAAATGAC GGGCAGAGGC GACATCCCGA ACTATCTCAA GGACTTCTCC GACGTGCTGC AGGGTTTGTC CATCGAAGGA GGCTTATACG ATTGCTATCG CATCACGGTG AGCGATAAAG ACGTGGGGCA AAACCGCCCG TACGACAAAT ATAAACTGAA GAATCAGACA CTGTCGTTCA AGGCTGATAT TCAGACAGGT GCGAGCACGG TCTCCGTCGA GGGCACGTAC ACGGTGAACT TCGCATTCGC TTGCGCTGTC GAAACGAATG AAGAGGCCGC CCTTTCATGG GGAACGAGCG TTTCGCCCTG GAACGTTCGA CTGGGACGGC AGTTCGTCGG CAATTTGACA GCTGGAAGCA ACAATGTGCA GGGGAAGTAT GCTCAATATA ACTGCTTTAT CCAAACGCGG AACATCGATT TGGCCGATGC GCCTGTCGTG AGCACGTCTA AATTCACTCA GCCGTTCTTA GGCTCGACGT ACGACGGAGG CGGCAACAGC ATATTCGGAA TACCGTACCG ACTGGCACAT GAAGGGATTA CCGGTGAGAA CCAGGGAAGA AACGGTCTGT TTCTTGATGT TGGCGGGGAG AGCCTGATTA AAAACGTGAA CATCGTGCTG GATCCCACCG ATGAAGGCTC AGAGTATGTG TTCGTTTCCA CCCATGGAAG CAAACTTCGA TTCGGACTCC TCGTCGGGTC GATTGACGGA GACGGAGCCG ATATTCAAAA CTGCACGGTC GGCGTTGCCG GCAAGGAGAC GGCTACGCTT AGGATCATCA AAAAGGGTAC AGGTGGCGAG GCGTATATCG GCGGTTTGGT GGGCTATGCA AGTCAAGCCA ATGTGAGCGA TCTATCGGTC AAGAATCTTC GTATTGAGGT TGTCGCCGAT GTCGAAGCAT GGAAAACCTC TCCCGCGATC GGGGGGATCT TCGGCTACGG CTCGCAGTTG AACATGTCGA GGAGCTCCGT CGATGGCTTC GCATGCGCTC TTTTCCAGCC TACGTTCGTT AACGAGAAAT ATCCCAGCCA AAACACGCGC ATCCATTTCG GCGGTTTGGT CGGCAGCGCT CAAATGGCAG TCATTAGCCA GAACGTGAAG AGCAACGCGG TTCTGCAGGT TCCTGCCGGC CAGTTGCGAG ATTCTGTTGT TGCTGGGCGG TACGTTGGGC AAGCATCAAT GAGCGCGGTT GCTCAGAACG AATCCGGGCA GGTGTTCGTC AAATACGGCG ACCCGAGTGA AGGCGGGGAA ACCGTTGAGG TGACTGCCGA CGTCGGGGTG CAGTAA
|
Protein sequence | MAKRGRHTEG FTLAELLMSV AIILILAAIA FPSIVSAQNN MRMLELNNAA QSIANAAQAQ MTAKKVSGTW VDAVKDGDSY RACFPAALAG SSHSQGETED VSRETLGSDE VAPPSASAAE SSKRYMTADT AREQGIVPAL AVEEAVRDAD YIIEFDADTA QVTGVFYADG RSGFFGSTPA STNAAKTYYE TEGASTDQAA RMGHDPMIGY YGGTPAGATP EKALANPVIW VDEATGCLMV QDPNIAADGS AGSTTSTVAI ENTGKNVAFS ISGLSNGTTM VSLYTADDGS ESVGFTNFAA AIKQQTRDNA NVKGNVWAID LNALSQLVAK GNDGKPAADD SQKAKLKQVF DACVAGDALT VSVETKDASR SCVPGTAAAH VEWPSPAGKL TMLITNPYSV VVAGEKNEAY IEPQVRSAVA DSAHPSIGAG FGNGDGVVKD GLTVNPFYED KDNHFRVSNA NAQLKQENPQ AGYQSYAGGW IASSSVRDDA TYRLEGTVGA YNNHAYQIWE LWIKRADTGE CMRVGYLNDG KWEWGVFNQK GVNYDYRFLN DCFTWYGTNE TGDASGTLAG TDTDTNNVIS LRLDVQKFYA EAENHKNHGL ADEDGNATVY VRTAPKASEV QAYFNKLAVP EPASVENPLK AAYLSGSAQE TGSRTADTPS VTARAAFEGE FGASSSDVSW AVSQTTTAGF SQGSEYLSTA ATVPVRVYYS IAPGVGFANI RSYDNGNGSG YLSGVLSTRL TNVSLWLYRG PSINDLAVMP PALLKNYKGL EFSCRQGSTY DFKITTQEDY RFYRALAYTV ENGSAPPSQY VPHASASDES VAKIAAAENY ETDDKLYTFK GWTTKDTMSG AELLVEADKL VSDYDGLSYQ GTTLVASYDE RKKVQPSLGM MYIETGTDKT GSPAYGYYGY IENNAAAVEN LLSNEYSVTD GGYYVVVPTG SNQPKMTGRG DIPNYLKDFS DVLQGLSIEG GLYDCYRITV SDKDVGQNRP YDKYKLKNQT LSFKADIQTG ASTVSVEGTY TVNFAFACAV ETNEEAALSW GTSVSPWNVR LGRQFVGNLT AGSNNVQGKY AQYNCFIQTR NIDLADAPVV STSKFTQPFL GSTYDGGGNS IFGIPYRLAH EGITGENQGR NGLFLDVGGE SLIKNVNIVL DPTDEGSEYV FVSTHGSKLR FGLLVGSIDG DGADIQNCTV GVAGKETATL RIIKKGTGGE AYIGGLVGYA SQANVSDLSV KNLRIEVVAD VEAWKTSPAI GGIFGYGSQL NMSRSSVDGF ACALFQPTFV NEKYPSQNTR IHFGGLVGSA QMAVISQNVK SNAVLQVPAG QLRDSVVAGR YVGQASMSAV AQNESGQVFV KYGDPSEGGE TVEVTADVGV Q
|
| |