Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3046 |
Symbol | |
ID | 8417381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3542404 |
End bp | 3543711 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645026026 |
Product | hypothetical protein |
Protein accession | YP_003183378 |
Protein GI | 257792772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCCG TTATTGTGAT CCCCACTTTC GTGTCCGCGC GTCGCCGCAA AGAAGGCGGC AGCGTGCTCA CGACCTATGA TCACGCGACT CCCATCTCGC AACCCGGCGA GCTCCCGCGT TTGCTTGCGT CGTTGCAGAA GGTGCGCGGC GCCGGCCAGA TCATCGTGCT CGTGGTCAGC GAGCCGTCGA TCGAGAACCA GGCGGCCGAG AAGGTTCAAA GCGTTGTCTC GCGCTTTCCC TCGCTGAACA CCGTGGTCAT CGGCGCTCCC GAGCTGGCAC TTATCCAGCA GCGCATGGAG CAGCTGGGTT TGGGCAAGCT GCAGAAGGAG ATCGGCCTGT CCGGCTACGG TGCGGTGCGC AACCTGGGTC TGGTGATGGC CGACGTGCTG GGCTTCGACT CGGTGGTGTT CCTCGACGAC GACGAAGTGG TGGATGACGC CGACTTCCTG CAGAAGGCCA TGTACGGCCT GGGCAAGCTC ACGAAGAAGG GCATTCCCAT CCTGGCCAAG ACCGGCTTCT ACTTCAATTC CGAAGGCTCC TACCTGTCGA AGAGCCAGGA CAAGTGGTAC AACCATTTCT GGCAGCAGGG AAAGGCCTTC AACAAATGGA TCTCGAAGGC CATGCGCGGC CCTCGTCTTT CCCGATCGAA CCATACGTGC GGCGGCTGCC TTGCTTTGCA TAAAGAGGCG TTCAAGCGTC TGTCGTTCGA TCCTTGGATC GCGCGCGGCG AAGATCTCGA TTACATGCTT GACCTGCGTA TGTACGGTTC GGACATCTGG TTCGACAATC AGTGGAGCCT GCGCCACCTT CCTCCCGAAA CCGAGAGCGA GGGCACGCGC TTCCGTCAGG ATATCTTCCG ATGGCTCTAC GAATACCGGA AGATGGAGTA CAGCCGCACG CAGATCGACC TTTTGCAGGT GAAGCCGTCT TCGCTGGAGC CGTATCCGGG CCCGTTCCTT GAGCCAGGCA TCACGAAGCG CATTCGTTTG ACCGCCTTTC TGAGGAGCTT GGCGCGCCCC GACAAGAAGG CGTACCGGAA AGCGGCGAAG GCGGCCACCG GCGAAGCGAC GACGTATGCC CAGCGCAACT GCTCGAAGTA CTTCGAGTTC CAGTTCGTGT GGCCGGAGCT GATGGCGCGC ATGGAGAACG ATCAGATCCT GCGTACGGCG CTTATGCAGT CGGCCGCGCA GCGCCAGGCC AGCGCCGGCA ACGGAGCCGA TCGACTTGCT TCGGCGCAGG CGGCCATCGC GGCGGCCGGC ATCGATCCGG GTGTGACGAG CGAGATTCGC CTGAACGTCG CGGAATAA
|
Protein sequence | MNPVIVIPTF VSARRRKEGG SVLTTYDHAT PISQPGELPR LLASLQKVRG AGQIIVLVVS EPSIENQAAE KVQSVVSRFP SLNTVVIGAP ELALIQQRME QLGLGKLQKE IGLSGYGAVR NLGLVMADVL GFDSVVFLDD DEVVDDADFL QKAMYGLGKL TKKGIPILAK TGFYFNSEGS YLSKSQDKWY NHFWQQGKAF NKWISKAMRG PRLSRSNHTC GGCLALHKEA FKRLSFDPWI ARGEDLDYML DLRMYGSDIW FDNQWSLRHL PPETESEGTR FRQDIFRWLY EYRKMEYSRT QIDLLQVKPS SLEPYPGPFL EPGITKRIRL TAFLRSLARP DKKAYRKAAK AATGEATTYA QRNCSKYFEF QFVWPELMAR MENDQILRTA LMQSAAQRQA SAGNGADRLA SAQAAIAAAG IDPGVTSEIR LNVAE
|
| |