Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0788 |
Symbol | |
ID | 8415078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 981631 |
End bp | 983403 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645023754 |
Product | TRAG family protein |
Protein accession | YP_003181151 |
Protein GI | 257790545 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCG CGCTCGTGGC TGCAATAGTG GCAACGCCTG CGGCAACTGC CACCGCGAAC CTCTACGCCG CCAACCTCAC GTCGCTTCCA GGAACGCCCA TCGAGAACCT GGCGGAAGCG GCGGCGACCC TGCCCTCGTA CTTTTTGGCC GGAGGTGGCT TCTCCGCAGA CCCCATCGCC CTCTGCGCTG CGGCCTTGGG CGCCTGCGCG GTGTGGGTCG CCTATGCCAA CCATCTCATG GGCAGCGCCG GCACGCGCAA GGGCGAGGAG CATGGAAGCT CGCGTTGGGC GACTAAAGCC GAGATGCGTG CGTTCGCTGA CGCCAAGGAC TCCGACAACA ACATCATCCT GACGCAGAAC TGTGCCCTCG CCCTCGTGCC GAGGAGGTTC GACCAGAAAA CGGACCGCAA CAAGAACGTC ATGGTGGTCG GTGGATCGGG GTCTGGCAAG ACCCGCTATT TTGTCAAGCC TAACCTCCTT CAGATGAACT GCAGCTACTT CGTCACCGAC CCGAAGGCCA CGCTCGCGGC CGAGCTGCGT GAGGCTTTGG AATCCGCGGG ATACCGGGTC GTCGAGTTCA GCACCATCGA CATGAATGGC TCGGCGCACT ACAACCCGAT CGCGTACGTG AAAAACGAGG CCGACGTGCT CACCTTTGTG GAATGCCTGA TAAGGAACAC CTCGGGCGAC GGCGAGCACT CCGGCGACCC CTTCTGGGAA GAGGCCGAGC GGCTTCTGTA CGTCGCACTC GTCTCCTACC TGGTGTTTCA TTGCCCAGAG GCGGACCGCA ACGTGCCGGG TCTGCTCACC CTTTTGGGCT TGGCCGAGGC GCGTGAGGAG GACGAGGACT TCAAAAGCCC GCTCGACATC GTCTTCGAGG AGCTCGAACG TGGGGCGTGC TTCACCCAAG TCGGCGCTCG CAATGCGTTC GACGCGGAGA GCCGCGGCTT CGATGACGCC GCCGGCGCTT GGCGCTGGAT CGGGGTCGCG AACCCCGTGA GCGCGCAAGA AGACTTCGCG CTGTCCAACT ACAAGGCTTT CAAGACGGCC GCCGGCAAGA CCCTCAAGAG CATCATCATC TCATGCAACG TGCGGCTGAA GCCGCTGTCG GTCAAGGAAG TTTCCGCGCT GCTGAGCCGC GACGAGATGA GCCTCGGATC AATGGGAGAC GCCGGGCAGA AGACATGCGT CTTCGCATCC ATGAGCGACA CCAACCCGAC CTTCAACTTC CTGTTCACGC TGCTCATGTG GCAGACGGTC GACGCCCTTT GCAATACGGC GCTTGAGCGC CACGGCGGAA GCCTGCCGAC CCACGTGCAC TTCGTCTTCG ACGAGTTCGC GAACATCGGC GAGCTCCCGA ACTTCGAGCA GACTATATCG ACTGTGCGAA GCAGGAACAT CTCCTGCTCG GTGATAGTTC AGTCGTTCGC GCAGCTTGAG AAGCGCTACC GCGAGGGCGC CGAGATCATC AAGGACAACT GCGACACGAC GCTGTTCCTT GGCGGCAAGG CGGTGAAGAC GAACAAGGAG ATCTCCGAGG CGATCGGCAA GGAGACCGTT TCGGCCATGA CCGTGAACGA CTCGCGCGGC CAAGGCTCGT CCACCACGCG CAACCGGCAG ATCATCGAGC GCGACCTCAT GCAGGCAAGC GAGGTTGGCC GCCTGCCAAG GGACGAGGCG ATCGTGCTCA TCGCGGGCGC CAACCCCATC CGCGACAAGA AGTACAGGCT CGAAGGGCAC CCGCGCTACG GGATGCTGGC GAAAGGCAGG TGA
|
Protein sequence | MDAALVAAIV ATPAATATAN LYAANLTSLP GTPIENLAEA AATLPSYFLA GGGFSADPIA LCAAALGACA VWVAYANHLM GSAGTRKGEE HGSSRWATKA EMRAFADAKD SDNNIILTQN CALALVPRRF DQKTDRNKNV MVVGGSGSGK TRYFVKPNLL QMNCSYFVTD PKATLAAELR EALESAGYRV VEFSTIDMNG SAHYNPIAYV KNEADVLTFV ECLIRNTSGD GEHSGDPFWE EAERLLYVAL VSYLVFHCPE ADRNVPGLLT LLGLAEAREE DEDFKSPLDI VFEELERGAC FTQVGARNAF DAESRGFDDA AGAWRWIGVA NPVSAQEDFA LSNYKAFKTA AGKTLKSIII SCNVRLKPLS VKEVSALLSR DEMSLGSMGD AGQKTCVFAS MSDTNPTFNF LFTLLMWQTV DALCNTALER HGGSLPTHVH FVFDEFANIG ELPNFEQTIS TVRSRNISCS VIVQSFAQLE KRYREGAEII KDNCDTTLFL GGKAVKTNKE ISEAIGKETV SAMTVNDSRG QGSSTTRNRQ IIERDLMQAS EVGRLPRDEA IVLIAGANPI RDKKYRLEGH PRYGMLAKGR
|
| |