Gene Hneap_1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1377 
Symbol 
ID8534533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1485655 
End bp1488513 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content56% 
IMG OID646383768 
Productvalyl-tRNA synthetase 
Protein accessionYP_003263258 
Protein GI261855975 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA CCTACAATCC TGCCGAGATC GAAGCCCCTT GTTATGCGCG CTGGCAAGCG 
GGAGGCTACT TCTCTCCCGA TGCCAGTCTG CCCGCCGATG CACCGAATTA TTGCATCATG
CTGCCGCCGC CGAATGTGAC CGGCCGCTTG CACATGGGTC ATGCCTTTCA GGATACCTTG
ATGGACATGC TCACCCGTGT CCACAGGATG CAGGGCGAAC GTACGCTCTG GCAACCCGGC
ACGGACCATG CGGGCATTGC CACGCAAATG GTGGTGGAGC GTCAGCTCGA GGCCGAGGGC
AAAACGCGGC ATGATCTGGG CCGCGAGGCA TTCACCGAGC GCGTCTGGCA ATGGAAAAGC
GAATCGGGCG GGTTTATCAC TGAACAGATG AAACGCCTGG GCGCATCCTG CGACTGGTCA
CGCGAGCGCT TCACCATGGA TGATGGTTTG TCCGATGCGG TGCGCGAAGT TTTTGTGCGC
TTGTTTGAAG ATGGGCTGAT TTATCGCGGT AAACGGCTGG TGAACTGGGA CCCGGTCTTG
CATACCGCCG TATCCGATCT TGAAGTCATC AGCGAAGAAG AAACCGGCCA CCTGTGGCAT
CTGCGTTATC CGCTCACCGA TGGCGGTGGT CATTTGATCG TCGCCACTAC GCGGCCAGAA
ACCATGCTGG GCGATACCGC CGTGGCCGTT CATCCGGAAG ATGAGCGGTA TAAACATCTG
ATTGGCAAAA CTATTACCTT GCCGCTGGTG GGGCGAGAGA TTCCGATCAT CGGTGATGAT
TATGTCGATC CGGCCTTCGG CTCGGGCTGC GTGAAAATCA CGCCTGCGCA TGATTTCAAC
GATTATGCCG TCGGACAACG ACACAACCTG CCCAAAATCA ACGTGTTGAC CATCGATGCG
CGGATTCGCG AACTGCCGGA AATTATCGGC GGTGAAGAGG AGGGCGCCTT GCCTGCTCAC
TACGCAGGTC TGGATCGCTA TGAAGCCCGT GATCGCATCA TCCATGATTT CAAAGAACTC
GATTTATTGG AAAAGATCGA CGATCACAAG CTCATGGTGC CGCGCGGCGA CCGCAGTGGC
GCGGTGATCG AGCCGATGCT GACCGACCAA TGGTTCGTCG ATTTGACCCG CGAAACTCAG
GACGATGGCC GTCCCGGTGG GCTGGCCGCC ATTACGCGCC CAGCGCTTGA GGCCGTGCGC
GGCGGCGATA TCAAGTTCGT GCCGGAAAAC TGGTCGAACA CCTATTATCA ATGGCTTGAG
AATATTCAGG ACTGGTGCAT CAGCCGCCAG ATCTGGTGGG GGCACCGGAT TCCTGCGTGG
TATGACGCGT CGGGCAGGGT GTATGTCGGG CGAGACGAAG CCGAAGTTCG GGCGAAATAC
GATCTGGAAA ACACAGTGGT TTTAACGCAG GAAAATGACG TACTCGATAC CTGGTTCTCA
TCTGCACTCT GGCCGTTTTC CACCTTGGGC TGGCCGCAGA ACACACAGGA ACTGGCGTAT
TTTTACCCTA CTAGTGTGCT GGTCACCGGC TTTGACATCA TCTTTTTCTG GGTCGCGCGG
ATGGTGATGA TGGGCAAGTA CTTCATGGGC GATGTGCCGT TTCGTGAGGT GTATGTGCAT
GGCCTGATTC GAGACGCGCA AGGGCAGAAA ATGTCCAAAT CCAAGGGTAA CGTGCTCGAC
CCGATTGACC TGATCGATGG CATTGATCTC GAATCGCTGG TTGCCAAGCG CACGGCTGGC
CTGATGCAGC CCAAAATGGC AGCGAAGATC GAAAAAGACA CGCGCAAGGA GTTTGCCGAT
GGCATTCCCG CTTTTGGTAC CGATGCCATG CGTTTTACCT TTGCTGCGCT GGCAACCACA
GGGCGCGATA TTCGCTTCGA TTTGGGGCGC ATCGAAGGTT ATCGAAATTT CTGCAACAAA
CTGTGGAATG CCAGCCGTTT TGTGATGATG CAATGCGAAG ATCAAGACAC GGGCCTCACC
GATGCACCGG TGACCTTGAG CGATGCGGAC GAGTGGATTA TCGGCCGTCT GCAACAGGTC
GAGGCGGAAG TTGCCAAGCA TTTTGCCGAC TATCGCTTCG ATCTGGCAGC CCAGACGCTG
TATGAATTCA CCTGGAACGA ATACTGCGAC TGGTATCTTG AGTTCACCAA ACCAGCGCTC
AAGGCAGACG ATGAAGCCGC GCAGCGCGGC ACCCGCCGCA CCTTGGTGCG TGTGCTCGAA
GCGCTTTTAC GTTTGCTGCA TCCGATTATC CCGTTCATCA CCGAAACCAT CTGGCAGCGC
TTGGCGCCGA TGGCATTGGT TGATGTGCAA TCAACCGATA GCATCCTTGG CCGCCCTTAT
CCCGCATTTG ACGAAAGCAA GATCAATACG CAGGCTATCG AGTCGGTCGA ATGGCTGAAA
CAGGTCATTT TGGGTGTGCG CCGTATTCGT GCCGAAATGG ACATTGCGCC CAGCAAGTCG
CTCGACGTGC TGATAACGCA TGCCACCGTT GAAGAGATCG CACGATTCGA GCGGTTTAGT
GCGCTGCTGA ATTCTGTCGG TCGGATTGGA AGTGTTACCG CATTGACCGC CCAAGAGGCC
GTGCCCGAAG CGGCCATGGC ACTGGTAGGT GAGTTGCAAA TCCACATCCC GCTGGCTGGT
TTGATCGACA AGCAGGCAGA ACTTGCGCGA CTCGATAGAG AAATCGAGCG GCTAACCAAG
GAGCTGGAAA AAGCCAAAGC GAAACTCGCC AATCCGAAAT TCGCCGACAA AGCCCCGCCC
GCCGTGGTGC AGCAAGAACG CGAGCGGGAA ACCAGCTTTC AGACGCAACT CCATGATTTG
TCCGGTCAAC GCGCGCGTAT CAGCCAGATC AGCGGTTAA
 
Protein sequence
MEKTYNPAEI EAPCYARWQA GGYFSPDASL PADAPNYCIM LPPPNVTGRL HMGHAFQDTL 
MDMLTRVHRM QGERTLWQPG TDHAGIATQM VVERQLEAEG KTRHDLGREA FTERVWQWKS
ESGGFITEQM KRLGASCDWS RERFTMDDGL SDAVREVFVR LFEDGLIYRG KRLVNWDPVL
HTAVSDLEVI SEEETGHLWH LRYPLTDGGG HLIVATTRPE TMLGDTAVAV HPEDERYKHL
IGKTITLPLV GREIPIIGDD YVDPAFGSGC VKITPAHDFN DYAVGQRHNL PKINVLTIDA
RIRELPEIIG GEEEGALPAH YAGLDRYEAR DRIIHDFKEL DLLEKIDDHK LMVPRGDRSG
AVIEPMLTDQ WFVDLTRETQ DDGRPGGLAA ITRPALEAVR GGDIKFVPEN WSNTYYQWLE
NIQDWCISRQ IWWGHRIPAW YDASGRVYVG RDEAEVRAKY DLENTVVLTQ ENDVLDTWFS
SALWPFSTLG WPQNTQELAY FYPTSVLVTG FDIIFFWVAR MVMMGKYFMG DVPFREVYVH
GLIRDAQGQK MSKSKGNVLD PIDLIDGIDL ESLVAKRTAG LMQPKMAAKI EKDTRKEFAD
GIPAFGTDAM RFTFAALATT GRDIRFDLGR IEGYRNFCNK LWNASRFVMM QCEDQDTGLT
DAPVTLSDAD EWIIGRLQQV EAEVAKHFAD YRFDLAAQTL YEFTWNEYCD WYLEFTKPAL
KADDEAAQRG TRRTLVRVLE ALLRLLHPII PFITETIWQR LAPMALVDVQ STDSILGRPY
PAFDESKINT QAIESVEWLK QVILGVRRIR AEMDIAPSKS LDVLITHATV EEIARFERFS
ALLNSVGRIG SVTALTAQEA VPEAAMALVG ELQIHIPLAG LIDKQAELAR LDREIERLTK
ELEKAKAKLA NPKFADKAPP AVVQQERERE TSFQTQLHDL SGQRARISQI SG