Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1919 |
Symbol | |
ID | 8535077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2052356 |
End bp | 2053447 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646384300 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_003263788 |
Protein GI | 261856505 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.402452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGGC TGAATACTAT GAATAGGGCG ATCGGTTTTT TGTGGGTCTG GCTGCTCACG ATGGGGCTTT TATTCGCCAC CGTATGGATG GTCGGCCGGA TTGGTGTTTT TCTCACCCAG CCCTTACTGC CCTCAGGCGC TGCGGCGATC ACGATTGAAA TCCCCAGTGG TGCGGACGCG CGGCAAATCG CTAAAATCGC TGATGCCTCC GGTGCGAGGG TCAATCCGAC CGTATTCGTG TGGGCGGCTC GATTGAGTGG TAAAGCGCGC TCAATTCAGG CGGGTGCGTA CCAGATTACC GATCAGGATC GGTTGCTCGG TTTTCTCGAT CGACTGGTCG AAGGCGATGT GGTGCGTTAC CGCATCACCA TCCCCGAAGG GGATACCGCC CAGGATTTCC TGAACAAGCT CGCGGCGCAA AAGGAAATAA AACACACGCT GAACGGGCTG GATCAGGCCC AGATCATCGC CGAGATGAAT TGGCCGATCA CCCATCTCGA AGGTTGGTTG TTCCCCGATA CGTATGTATT CACACGCGGC ACCACGGACA AAAAGATTCT GCAGGAAGCC TACCGCTCGA TGCGGTCTCA TCTGGACGCG GCATGGGCGG ATCGCGCACC CGGGCTGCCC TTGAAAACGC CCTACGATGC ACTGATTCTG GCTTCTATCG TGGAAAAGGA AACCGGCTTG CCCGATGAAC GCGCCATGGT GGCGGGTGTA TTCATCAACC GATTGAACAT CGGAATGCGG TTGCAGACGG ATCCGGCTGT CATCTACGGC GTGGCGGAGG CAACTCAGGG ACAGGTTGAC GAGGACAGTT CGCCACGAAG TCTGACGCTA AGCCAGCTGC GCGCCGATAC GCCGTACAAT ACCTACACCC GCACCGGTTT GCCGCCGACG CCGATTGCCC TGCCATCCGC AGCTGCATTG CAGGCTGTGA CGCATCCCGA TAAAACGGAT GCCCTGTATT TTGTTGCCAA TGGCACGGGC GGACACACCT TTTCGCGCAC ACTGAAAGGA CACAATCAAG CCGTGCAGAC CTGGCGTAAA ATTGAAGATA CGCGGGCATC CGAACCGAAA AAAAAGCAAT GA
|
Protein sequence | MKWLNTMNRA IGFLWVWLLT MGLLFATVWM VGRIGVFLTQ PLLPSGAAAI TIEIPSGADA RQIAKIADAS GARVNPTVFV WAARLSGKAR SIQAGAYQIT DQDRLLGFLD RLVEGDVVRY RITIPEGDTA QDFLNKLAAQ KEIKHTLNGL DQAQIIAEMN WPITHLEGWL FPDTYVFTRG TTDKKILQEA YRSMRSHLDA AWADRAPGLP LKTPYDALIL ASIVEKETGL PDERAMVAGV FINRLNIGMR LQTDPAVIYG VAEATQGQVD EDSSPRSLTL SQLRADTPYN TYTRTGLPPT PIALPSAAAL QAVTHPDKTD ALYFVANGTG GHTFSRTLKG HNQAVQTWRK IEDTRASEPK KKQ
|
| |