Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0103 |
Symbol | |
ID | 8414386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 131765 |
End bp | 132796 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645023082 |
Product | transglutaminase domain protein |
Protein accession | YP_003180486 |
Protein GI | 257789880 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG5279] Uncharacterized protein involved in cytokinesis, contains TGc (transglutaminase/protease-like) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCTGC TCGGCATCAC AGCCGCTGGA CGGAAACGAC GCGACGGAGC TTGCACGCCG CAACATCGGC GCGCCCGCAC GCTGCGTGCG ATCGTCCGCA GCGCGTGCGC GGCGGTTCTG GCGACGACGC TCGTCGCGCT GGGAGGATGT AGCGAAAACG ACGCGGTCGG CAAAACCTCG AACCCCTCAG GCGGCCAGAG CCAGACGAGC GGCCCGGCCT ACGAGCGGCC GGAGCTTGCG CTCTCCCCGT TCGACCAAGC GGCGGCCGGC GGCGAAAACG GCGTGAGCAT CGACGCTTCA AACCTGGCGA AAGGCTACGT GGCCGTCTCG GCAACTGCAT CAACCCGCTT GAAGCTCCAA GTTTCGTTTG AGGGTTCGGA AACGCAGTAT TATTTCGATC TGCCCAGCGA CGGCACGCCC ATCTCGTGCC CTCTCGTTCA AGGCAGCGGC GCGTACACGT TCACCGTCTG GGAAAACACT ACCGGACAAC GGTATTCGGA GCTGTACTCG CTCGCCGACC AGCCTGTGAC GCTGGCTGAC GAATTCCAAC CGTTCATCCG ACCGAGCGTC TACTGCGACT ACGACGCATC CAGCAAAAGC ACGCAGCTGG CGAACGACCT GTCCGCCGAC GCCCAGAACG AGGGCGACGT GGTGCGCGGG ATCTACGACT GGATCGTTGA AAATATCGCA TACGACGAGG ACAAAGCCGC TCGGCTTGCC GATGCGACGG GCTACCTTCC CAACCCCGAT TCGTGCATCG CCGACGGCGC GGGCATCTGC TTCGACTACG CATCGTTGGC GGCCGCCATG CTGCGCAGCC AAGGGATCCC CTGCAAGATA ATCACCGGGT ACGTATCGCC CGATAATATA TACCACGCTT GGAACATGGT GTATATTGAC GGCACATGGG TCGATGCCCA TATCGATATC AAGCAGAACA CGTGGACGCG TATCGATACG ACGTTCGCGG CCGGAAGCGG GTCGAGCTAC GTGGGAGACG GAATCGCGTA CACCGACCTC TACACGTACT GA
|
Protein sequence | MHLLGITAAG RKRRDGACTP QHRRARTLRA IVRSACAAVL ATTLVALGGC SENDAVGKTS NPSGGQSQTS GPAYERPELA LSPFDQAAAG GENGVSIDAS NLAKGYVAVS ATASTRLKLQ VSFEGSETQY YFDLPSDGTP ISCPLVQGSG AYTFTVWENT TGQRYSELYS LADQPVTLAD EFQPFIRPSV YCDYDASSKS TQLANDLSAD AQNEGDVVRG IYDWIVENIA YDEDKAARLA DATGYLPNPD SCIADGAGIC FDYASLAAAM LRSQGIPCKI ITGYVSPDNI YHAWNMVYID GTWVDAHIDI KQNTWTRIDT TFAAGSGSSY VGDGIAYTDL YTY
|
| |