Gene Hneap_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2109 
Symbol 
ID8535268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2261083 
End bp2262591 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content59% 
IMG OID646384486 
Productanthranilate synthase component I 
Protein accessionYP_003263973 
Protein GI261856690 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.490453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTG CTCCAACATC TGCTCAATTG AAGGCACTTG CCGCCGAAGG CTATAACCGC 
GTCCCGCTCA CCCGCGCCAT ATCCGGCGAT TACGACACGC CGTTGTCGGT CTATCGCAAA
TTGGCCGACG CGCCCAACAG TTATTTGTTC GAATCGGTTA TGGGTGGCGA ACGCTGGGGG
CGTTATTCGA TCATCGGTCT CGCGGCCCGT ACGGTGCTGC GTGTTTATGG TCACAAGATC
GAAGTGCGCC GCGACAATGA ACTGATTGAA ACCACCGAAG CGGATGATCC ACTTGCCTGG
GTCGAGTCGT TCAAGGGCCG TTTCCGTGTG TTCGAGCCCG AAGGTATGCC GCGTTTTCAT
GGCGGGCTGG TGGGTTATTT CGGCTTCGAA ACCATTCGCT ATATCGAGCC ACGGCTGGCA
GCCAGCCCGC CCAAGCCCGA CCCGTTGGGC ACGCCCGATA TTTTGCTCAT GGTGTCCGAA
CAGGTCGTGG TGGTCGATAA CTTGTCCAGC CAGCTTCTGC TGGTGACACT GGTTGATCCG
GCCCAAGCCG ATGCCCTTGA AGCGGGCGCG CATCATCTGG ATGCCCTGAC CGAGCGCCTG
CGCAGCGAGC AGGTGCATTA CGCGCCACTG GCCCAGCCTA GGCATATCGA CGAAAACGAC
TTTACAGCCA GTTTCACCCG CGAAGGTTAT GAATCGGCCG TGCTGAAAAT CCGAGAATAC
ATCGCGGCAG GCGACGTGAT GCAGGTGGTG CCCAGCCAAC GGATGACCAT TGGCTACGAT
GCGCCACCGA TCGACCTGTA TCGCGCCCTG CGCAGCCTGA ACCCCTCGCC TTACATGTAT
TACATCGATT GCGGCGATCA TCAGGTCATC GGCTCAAGCC CCGAGATTCT CGCCCGTCTC
GAAGACAACG AAATCACCGT GCGCCCGATT GCAGGCACAC GCCCGCGCGG CAAAACCCAT
GCGGAAGATC TCGCGCTCGA ACAGGAACTG CTGAGTGACC CCAAAGAAAT CGCCGAGCAC
GTCATGCTCA TTGATCTGGG GCGCAGCGAT ACCGGTCGTG TCGCTGAAAT AGGTTCCGTT
AAACTCGAAG AACGCATGAT CATCGAACGT TATTCGCATG TGATGCACAT CGTCTCGCAA
GTGACCGGTC AGCTTAAAGC AGGTCTCAAT GCCATCGACG TACTCCGCGC CACCTTCCCG
GCAGGCACAG TAAGCGGCGC GCCGAAGATC CGCGCACTGG AAATCATCGA TGAACTCGAA
CCCGTCAAGC GCGGCGTCTA TGCCGGTGCC GTCGGTTACT GGGCATGGAA CGGCAACATG
GACACCGCCA TCGCCATCCG CACCGGCGTC CTCAAAGATG GTGAACTCCA TATCCAAGCC
GGTGGCGGCA TCGTCGCCGA CTCCATTCCT GCCAACGAGT GGGAAGAAAC CCTTAACAAA
CGCCGAGCCC TGTTCCGCGC CGCCGCGGTC GCGCAGGCGG GGGTGGATGG CGCAGTAAGG
CGCGGTTGA
 
Protein sequence
MSFAPTSAQL KALAAEGYNR VPLTRAISGD YDTPLSVYRK LADAPNSYLF ESVMGGERWG 
RYSIIGLAAR TVLRVYGHKI EVRRDNELIE TTEADDPLAW VESFKGRFRV FEPEGMPRFH
GGLVGYFGFE TIRYIEPRLA ASPPKPDPLG TPDILLMVSE QVVVVDNLSS QLLLVTLVDP
AQADALEAGA HHLDALTERL RSEQVHYAPL AQPRHIDEND FTASFTREGY ESAVLKIREY
IAAGDVMQVV PSQRMTIGYD APPIDLYRAL RSLNPSPYMY YIDCGDHQVI GSSPEILARL
EDNEITVRPI AGTRPRGKTH AEDLALEQEL LSDPKEIAEH VMLIDLGRSD TGRVAEIGSV
KLEERMIIER YSHVMHIVSQ VTGQLKAGLN AIDVLRATFP AGTVSGAPKI RALEIIDELE
PVKRGVYAGA VGYWAWNGNM DTAIAIRTGV LKDGELHIQA GGGIVADSIP ANEWEETLNK
RRALFRAAAV AQAGVDGAVR RG