Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2109 |
Symbol | |
ID | 8535268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2261083 |
End bp | 2262591 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646384486 |
Product | anthranilate synthase component I |
Protein accession | YP_003263973 |
Protein GI | 261856690 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.490453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTG CTCCAACATC TGCTCAATTG AAGGCACTTG CCGCCGAAGG CTATAACCGC GTCCCGCTCA CCCGCGCCAT ATCCGGCGAT TACGACACGC CGTTGTCGGT CTATCGCAAA TTGGCCGACG CGCCCAACAG TTATTTGTTC GAATCGGTTA TGGGTGGCGA ACGCTGGGGG CGTTATTCGA TCATCGGTCT CGCGGCCCGT ACGGTGCTGC GTGTTTATGG TCACAAGATC GAAGTGCGCC GCGACAATGA ACTGATTGAA ACCACCGAAG CGGATGATCC ACTTGCCTGG GTCGAGTCGT TCAAGGGCCG TTTCCGTGTG TTCGAGCCCG AAGGTATGCC GCGTTTTCAT GGCGGGCTGG TGGGTTATTT CGGCTTCGAA ACCATTCGCT ATATCGAGCC ACGGCTGGCA GCCAGCCCGC CCAAGCCCGA CCCGTTGGGC ACGCCCGATA TTTTGCTCAT GGTGTCCGAA CAGGTCGTGG TGGTCGATAA CTTGTCCAGC CAGCTTCTGC TGGTGACACT GGTTGATCCG GCCCAAGCCG ATGCCCTTGA AGCGGGCGCG CATCATCTGG ATGCCCTGAC CGAGCGCCTG CGCAGCGAGC AGGTGCATTA CGCGCCACTG GCCCAGCCTA GGCATATCGA CGAAAACGAC TTTACAGCCA GTTTCACCCG CGAAGGTTAT GAATCGGCCG TGCTGAAAAT CCGAGAATAC ATCGCGGCAG GCGACGTGAT GCAGGTGGTG CCCAGCCAAC GGATGACCAT TGGCTACGAT GCGCCACCGA TCGACCTGTA TCGCGCCCTG CGCAGCCTGA ACCCCTCGCC TTACATGTAT TACATCGATT GCGGCGATCA TCAGGTCATC GGCTCAAGCC CCGAGATTCT CGCCCGTCTC GAAGACAACG AAATCACCGT GCGCCCGATT GCAGGCACAC GCCCGCGCGG CAAAACCCAT GCGGAAGATC TCGCGCTCGA ACAGGAACTG CTGAGTGACC CCAAAGAAAT CGCCGAGCAC GTCATGCTCA TTGATCTGGG GCGCAGCGAT ACCGGTCGTG TCGCTGAAAT AGGTTCCGTT AAACTCGAAG AACGCATGAT CATCGAACGT TATTCGCATG TGATGCACAT CGTCTCGCAA GTGACCGGTC AGCTTAAAGC AGGTCTCAAT GCCATCGACG TACTCCGCGC CACCTTCCCG GCAGGCACAG TAAGCGGCGC GCCGAAGATC CGCGCACTGG AAATCATCGA TGAACTCGAA CCCGTCAAGC GCGGCGTCTA TGCCGGTGCC GTCGGTTACT GGGCATGGAA CGGCAACATG GACACCGCCA TCGCCATCCG CACCGGCGTC CTCAAAGATG GTGAACTCCA TATCCAAGCC GGTGGCGGCA TCGTCGCCGA CTCCATTCCT GCCAACGAGT GGGAAGAAAC CCTTAACAAA CGCCGAGCCC TGTTCCGCGC CGCCGCGGTC GCGCAGGCGG GGGTGGATGG CGCAGTAAGG CGCGGTTGA
|
Protein sequence | MSFAPTSAQL KALAAEGYNR VPLTRAISGD YDTPLSVYRK LADAPNSYLF ESVMGGERWG RYSIIGLAAR TVLRVYGHKI EVRRDNELIE TTEADDPLAW VESFKGRFRV FEPEGMPRFH GGLVGYFGFE TIRYIEPRLA ASPPKPDPLG TPDILLMVSE QVVVVDNLSS QLLLVTLVDP AQADALEAGA HHLDALTERL RSEQVHYAPL AQPRHIDEND FTASFTREGY ESAVLKIREY IAAGDVMQVV PSQRMTIGYD APPIDLYRAL RSLNPSPYMY YIDCGDHQVI GSSPEILARL EDNEITVRPI AGTRPRGKTH AEDLALEQEL LSDPKEIAEH VMLIDLGRSD TGRVAEIGSV KLEERMIIER YSHVMHIVSQ VTGQLKAGLN AIDVLRATFP AGTVSGAPKI RALEIIDELE PVKRGVYAGA VGYWAWNGNM DTAIAIRTGV LKDGELHIQA GGGIVADSIP ANEWEETLNK RRALFRAAAV AQAGVDGAVR RG
|
| |