Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1945 |
Symbol | |
ID | 8535103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2080688 |
End bp | 2081911 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 646384326 |
Product | hypothetical protein |
Protein accession | YP_003263814 |
Protein GI | 261856531 |
COG category | [H] Coenzyme transport and metabolism [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00538103 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAG AAGACCACGT GGACATTCCC TCCACCCCTT TCTTGGGGTT GGCTCCCTTT CTGAGAGCCA GCATCGAAGG GCAGGATTTG CAGGCATTTG CCAGAGCGGC TCTAGGCAAC CTGAACCAAT TCCCGCAGGA TGCAAACCTC TGGATGAACC TGTCCACTTT GATGTTCAGT CTGGGGCAAC GTGAATCCGC ATTTGCCACG CTGAATCAGG GATTATCGCT ACAACGCAGC TTTGAGATTC CTGCATTGGC GCAGCCGAGC AGCTTCAGCG TTCTTATGCT GATGGTTCCG GGTGACATCG CGGCCAATAC ACCGCTGGAT TGCTTGCTCG AAGGCAGTGA TATTGATCTG ATTTGCCACT ACTGCGCACT CGACGCGCTC CTACCCGACC CGTTACCGGC TCATGATGCT GTTTTCGTTG CGATAGGCGA TGCCCCACAG CACCGCGCAC TTCTGACCGA ACTGGCCGCC GCGCTCCAAG CTTGGCCCGT GCGCGTGTTG AATTCCCCGC AAGCCATCCC GAATACCCAA CGGGACACGG CCTGCCAGAT ACTGCAGGAC ATCCCGGGCT TACTCATACC GACGACGTAC CGCACGTCCC GAGCACAACT GACAGCCATT GCCGAAACCT CACACACCCT ATCGGTCATT GCCGCTGGCC TCGACTTCCC GCTGATCGTG CGCCCGCTCG ACTCCCACGC CGGTCGTGAT CTTGAACGCG TGGCAGATAG AGCTGCATTA CAGCGCTATC TGGATCAAGT TTCCTCAACA GAGTTCTTCA TCGCGCCCTT CATCGACTAC AGCGGATCCG ATGGCCTGTT TCGCAAGTTC CGCATCGCAC TGGTCGATGG AAAACCCTTT GCCGTTCACA TGGCTGTTTC CGCGCACTGG ATGATCCATT ACGTCAACGC GGGCATGTAT GAAGATGCGG AAAAACGGCA GGAAGAAGCA CGCTTCATGA TCAATTTTGA CGCATTCATC GCCCGACATG GCCAGGCGCT CGACATGATC GCGGAACGAA TGGGGTTGGA CTATGTCTTG TTTGACGGCG CCGAAACGCA AGCAGGAGAT CTGCTTATTT TCGAAATCGA TCATGTGATG GTCGTGCACG CCATGGACCC GGTTGAGTTG TTCCCTTATA AACGAGAGCC GATCCAGCAA ATTCAGACCG CCTTTCGTCA GTTATTGGCA ACATCCACCG CTGAAACGGG ATAA
|
Protein sequence | MTEEDHVDIP STPFLGLAPF LRASIEGQDL QAFARAALGN LNQFPQDANL WMNLSTLMFS LGQRESAFAT LNQGLSLQRS FEIPALAQPS SFSVLMLMVP GDIAANTPLD CLLEGSDIDL ICHYCALDAL LPDPLPAHDA VFVAIGDAPQ HRALLTELAA ALQAWPVRVL NSPQAIPNTQ RDTACQILQD IPGLLIPTTY RTSRAQLTAI AETSHTLSVI AAGLDFPLIV RPLDSHAGRD LERVADRAAL QRYLDQVSST EFFIAPFIDY SGSDGLFRKF RIALVDGKPF AVHMAVSAHW MIHYVNAGMY EDAEKRQEEA RFMINFDAFI ARHGQALDMI AERMGLDYVL FDGAETQAGD LLIFEIDHVM VVHAMDPVEL FPYKREPIQQ IQTAFRQLLA TSTAETG
|
| |