Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1755 |
Symbol | |
ID | 8416059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2061035 |
End bp | 2061955 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645024726 |
Product | type II secretion system protein |
Protein accession | YP_003182109 |
Protein GI | 257791503 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4965] Flp pilus assembly protein TadB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0468876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.156837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATT TGATCGGATA CGTGATCGTG ATCCTGGCGC TTGTGTGCGG GATGGCCGGT TTCGTGTCGG CCCGTCGCTT GTCGGTGATG GAGCGCGACC GCACGTTCGA GCAAGTGGAG ATATACAGAG GGGGGAGGCA GCGGAGACGG CAGCTTCAAA GAGCGCAAAC CAAATCCGGA AGCAGGCTCG AGGCGATGCT CCTGTCGGCC GGGATGGCGA TCTCGCCGGC GATGTTCTTG GTGGGAGTCG TGATCGCCGG CGCGCTCCTC GGCGCCTTCT TGCATGGGCC CCTCGGAGGC TTCGGCTTGC TTGTCGGCGT TGCCGGGGCT GCGGCCGCGG CATGGGCGCT CCTGCTGACG GCTGCGAAAA AGCGCTCGAT GCAGTTCGAA ACACAACTGG CTTCGGCGCT TCCGATGATC TCCGAGAACC TGCGTGGCGG GTCGAGCTTC GAAATGTCCA TATCCGCAGT CGCCCAGTTC ATGCCGGAGC CCATACATTC CGAGCTGCAG CGCGTCATCG AGGACGTTTC CAATGCGTCC ATCTCGCTCC CCGATGCTTT CGAGCGGCTT GCGATCCGCA TCCAAAGTTC CGAGGCGCAC CTGCTTGCGA CCGCCGTCAT GATCCAGAAA GAAGGAGGCG GGAACCTGGC TTCCGTTGTC GACACGATAG CCGAAACCAT CGCGCGCAGA ATCGAGCTGA AGAACAAAGT GCGTTCTTCG ACCGCCAACG CGCGCTTCTC CGCCGTGTTC GTGGCCGCCG TGCCTTTCGC GGTGCTCGCG TTCTTCACGT ACGGCTCTCC CGGCTATATG GATAATTTCT ACGCATGGCC TTTTTGGCCC GCCGTGATCA TCGCCGCCGC AGTGCTCGAT GGCATAGGAC TGTTTCTCAT CAGCCGTATG TACAAGTTCA AAGCAGAGTA G
|
Protein sequence | MNDLIGYVIV ILALVCGMAG FVSARRLSVM ERDRTFEQVE IYRGGRQRRR QLQRAQTKSG SRLEAMLLSA GMAISPAMFL VGVVIAGALL GAFLHGPLGG FGLLVGVAGA AAAAWALLLT AAKKRSMQFE TQLASALPMI SENLRGGSSF EMSISAVAQF MPEPIHSELQ RVIEDVSNAS ISLPDAFERL AIRIQSSEAH LLATAVMIQK EGGGNLASVV DTIAETIARR IELKNKVRSS TANARFSAVF VAAVPFAVLA FFTYGSPGYM DNFYAWPFWP AVIIAAAVLD GIGLFLISRM YKFKAE
|
| |