Gene Hneap_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2020 
Symbol 
ID8535179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2165889 
End bp2166899 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content56% 
IMG OID646384402 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_003263889 
Protein GI261856606 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000713321 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCTG CTGCCAAACG ACCCATTGTC CTTTCGGGGA TTCAGCCTTC CGGCCAATTA 
TTGATCAGTC ATTATGTGGG GGCGATGCGC AACTGGGTGG CGATGCAGGA TACCCATGAT
TCACTTTTCA TGTTGGTGGA TCTGCACGCC ATTACCGTTC GGCAGGACCC AGCGGTCTTT
CGTGCGCGCT GTTACGACTT CGTGGCGCTT TATCTGGCTT GCGGGCTCGA TCCCCAGAAA
AATACGATTT TCGTGCAGTC CCATGTTTCT GCTCATGCGG AACTCGGTTG GTTGCTCAAC
TGCTATACGA ATATGGGCGA GTTGGAACGG ATGACCCAGT TCAAGGATAA GTCCACGCGC
GCAGGGGCGG TGATCAATGT GGGCTTGTTC GATTACCCTG TGTTGATGGC GGCGGATATT
TTGCTCTATC AGGCGACGCA CGTACCCGTG GGCGCCGATC AAAAGCAACA TCTGGAGCTT
ACCCGCGATC TGGCGATTCG ATTCAATCAC ATTTATGGGG GCGTATTTAC GGTGCCTGAA
CCGTTCATTC CCGACACGGG CGCACGCATC ATGTCGTTGC AGGAGCCAAC CAAAAAGATG
TCCAAGTCGG ACCCGAGCGA ATTGAGTTAT GTCGGCCTGT TGGACGAACC CAAAACGATT
CTGAAAAAAT TCAAACGCGC GGTGACTGAC TCGGACACGG CCATTCGTTT TGATGTGGAA
AACAAGCCCG GTGTTTCTAA CCTGCTGACC TTGTTTTCTA TTTTCTCCGG CGAATCCATT
CCCGATCTGG AAACCCGGTT GGACGGGCAG GGTTACGGCA CGTTGAAGGT TCAGACTGCC
GAGGCGGTGA TCGCCTTTCT GGAGCCGATA CAGGCTCGTT TCCATGAACT GCGTGCCGAT
GAAGCGGCAC TCGATCGCAT CCTCGCCGAT GGCGCCGCCC GTGCGCGGGC CCGTGCGGAG
CCGACCTTGC AGCGGGCCTT TGACGTGCAC GGCTTTCTGC CGCGCCGCTG A
 
Protein sequence
MTSAAKRPIV LSGIQPSGQL LISHYVGAMR NWVAMQDTHD SLFMLVDLHA ITVRQDPAVF 
RARCYDFVAL YLACGLDPQK NTIFVQSHVS AHAELGWLLN CYTNMGELER MTQFKDKSTR
AGAVINVGLF DYPVLMAADI LLYQATHVPV GADQKQHLEL TRDLAIRFNH IYGGVFTVPE
PFIPDTGARI MSLQEPTKKM SKSDPSELSY VGLLDEPKTI LKKFKRAVTD SDTAIRFDVE
NKPGVSNLLT LFSIFSGESI PDLETRLDGQ GYGTLKVQTA EAVIAFLEPI QARFHELRAD
EAALDRILAD GAARARARAE PTLQRAFDVH GFLPRR