Gene Hneap_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1084 
Symbol 
ID8534231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1170821 
End bp1172185 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID646383468 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_003262967 
Protein GI261855684 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.462524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGTTA TTTATGACTC CCACGCACGC GCCAAGCGTG AATTTGTGCC GATTAAACCG 
AAAACAGTGG GGATGTATGT TTGCGGCATG ACGGTTTATG ACCGCTGCCA CATCGGCCAT
GCGCGGGTAA TGGTCGTGTT CGATATGGTT GTCCGTTACT TCAGGGCAAG TGGCTATGAG
GTGCATTACG TGCGCAACAT CACGGATATC GACGACAAGA TCATTGCCCG TGCCGCCGAG
AACAAGGAAA GTATTCGCAG CCTGACCGAG CGTTATATCG AAGCTATGCA TGATGACGCG
GCCGCCTTGG GCTGCCTGTT ACCCACAGCT GAACCACGCG CTACCGAATC GGTAGGCGAT
ATGGTGGCGA TGATCGAAAG CCTGATCGAG CGCGGCCACG CCTACCGTGC AGCCAATGGC
GATGTGTACT TTGATGTCTC CACGTTTGAT CGATATGGCG AACTTTCCAA CCGTAATCCT
GATGACTTAC GTGCGGGCGC GCGCGTTGAG ATCGACGAGG CGAAAACCGA TCCGCTCGAT
TTTGTGCTCT GGAAGGCCGA AAAACCCGGA GATCCGAGTT GGGATGCGCC TTTTGGCCCC
GGTCGTCCGG GCTGGCATAT CGAATGCTCG GCCATGTCGA CCCGCGCGCT GGGGGCGACC
TTCGATATTC ACGGGGGAGG GCAGGATCTG CAGTTCCCGC ACCATGAAAA CGAAATCGCC
CAAAGCGAGT GTGCGACGGG TCATCATTAC GTCAACTACT GGATGCACAA CGGCTTTGTG
CGGATCAACG AAGAGAAGAT GTCCAAATCC CTGGGCAATT TTTTCACTGT GGCGGAGGTG
ATGCGGCAAT ATCACCCGGA AGTGATTCGG TTGTTCGTGT TGTCCAGCCA TTACCGCAGC
CCATTGAATT ATTCGGACCA GAATCTGGAT GCGGCGCGCG CCAGTCTGAC ACGCTGGTAC
ACGGCTATAA AGGACGCACC CCAAAATGGA ACACCTAACC CGGAGGTTAT GGCGCGTTTT
CGTGGGGTGA TGGACGACGA TTTCAATACA CCGGAAGCAT TGGCGATCGT GTTCGAGCAG
ATCAGCGAAT TGAACCGCAG CAAGGATGCC AATTGCGCCG CGACGATAAA GGCCATCGGT
GAAATCCTGA ATCTCGGACA GCACGATCCG GAAAGTTTTC TGCGTTGGGC ACCATCCTCT
TCGGATCAAT TGAGCGACGA GGCGATTGAA CAGAAAATCG CCGAACGTGC TAGCGCGCGC
GCCAACAAGG ATTTTGCGGC ATCGGACCGG ATTCGCGATG AGCTTCAGGC CGCAGGCATC
GTGCTCGAAG ACAAGGCCGG GCAAACCACT TGGCGGCGCG GCTGA
 
Protein sequence
MLVIYDSHAR AKREFVPIKP KTVGMYVCGM TVYDRCHIGH ARVMVVFDMV VRYFRASGYE 
VHYVRNITDI DDKIIARAAE NKESIRSLTE RYIEAMHDDA AALGCLLPTA EPRATESVGD
MVAMIESLIE RGHAYRAANG DVYFDVSTFD RYGELSNRNP DDLRAGARVE IDEAKTDPLD
FVLWKAEKPG DPSWDAPFGP GRPGWHIECS AMSTRALGAT FDIHGGGQDL QFPHHENEIA
QSECATGHHY VNYWMHNGFV RINEEKMSKS LGNFFTVAEV MRQYHPEVIR LFVLSSHYRS
PLNYSDQNLD AARASLTRWY TAIKDAPQNG TPNPEVMARF RGVMDDDFNT PEALAIVFEQ
ISELNRSKDA NCAATIKAIG EILNLGQHDP ESFLRWAPSS SDQLSDEAIE QKIAERASAR
ANKDFAASDR IRDELQAAGI VLEDKAGQTT WRRG