Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0957 |
Symbol | |
ID | 6164519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 846867 |
End bp | 847877 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641668112 |
Product | delta-aminolevulinic acid dehydratase |
Protein accession | YP_001794338 |
Protein GI | 171185419 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0113] Delta-aminolevulinic acid dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.283226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000405036 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGGTACC CCGCCGCCAG GCCCCGCCGC CTAAGGACAA GCAAAATCGT TAGAGACGCG GTGGCGGAGA CAGCCCTGGA CCCCGGCGAC TTCATCTACC CCATCTTCGT CAAGGAGGGC CCCGGCCCCG AGGCCATACC CACCATGCCC GGCCAACACC GGTGGCCCGT CGGCGAGGAG CTCGTCAAAC ACGTGGAGGA GGCCCTCGCC CTGGGAGTCA ACAAGTTCAT CCTATTCGGC GTGGTGCCCG AGGAGCAGAA AGACCCCCAC GGCTCCAGGG GCTACGACCC GGAGGGCCCC GTCCCCAAGG CCCTCCGCCT CCTAAAGGAG ACCTTCGGCG ACAAGGCGCT CCTCTTCGCA GACGTCTGCC TCTGCGAATA CACAGACCAC GGACACTGCG GCGTGGTAGA GACCGCCGGC GGCAGGTGGC ACGTCGACAA CGACAAGACC ATAAAGCTCT ACGCCAAGGA GGCCCTCGTG TACGCAGACG CCGGCGCCGA CTTCGTCGCC CCAAGCGGCA TGATGGACGG CCAGGTGGCC GAGATCAGAA AAGCCCTAGA CGCCCACGGC TTCCACCACG TGGGCATCAT GGCATACAGC GCCAAATACG CCTCAGCCTT CTACGGCCCC TTCCGCACAG CCGCCGCCTC AGCCCCCAAA TTCGGCGACA GGAGGACATA CCAGATGGAC CCCAGAAACG CCCACGAAGC CCTCAAAGAA GTCGCCATGG ACCTGGAGGA GGGCGCAGAC ATAGTCATGG TAAAGCCAGC CCTCGCATAC CTAGACGTAA TCCGCCTAGT AAAACAGCAC TACCCCTGGG CCCCCCTAGC CGCCTACAAC GTCTCCGGAG AATACGCCAT GGTAAAAGCC GCCGCAGCCG CCGGATACAT AGACGAGAAG GTCACCACGC TGGAGATCCT AACCGCAATA AAAAGGGCAG GCGCAGACTT AATCCTCACC TACCACGCCC CAGAAGCCGC AAAATGGCTA AAAGACGGCA CCCCCTTCTA G
|
Protein sequence | MRYPAARPRR LRTSKIVRDA VAETALDPGD FIYPIFVKEG PGPEAIPTMP GQHRWPVGEE LVKHVEEALA LGVNKFILFG VVPEEQKDPH GSRGYDPEGP VPKALRLLKE TFGDKALLFA DVCLCEYTDH GHCGVVETAG GRWHVDNDKT IKLYAKEALV YADAGADFVA PSGMMDGQVA EIRKALDAHG FHHVGIMAYS AKYASAFYGP FRTAAASAPK FGDRRTYQMD PRNAHEALKE VAMDLEEGAD IVMVKPALAY LDVIRLVKQH YPWAPLAAYN VSGEYAMVKA AAAAGYIDEK VTTLEILTAI KRAGADLILT YHAPEAAKWL KDGTPF
|
| |