Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1998 |
Symbol | |
ID | 8535157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2141741 |
End bp | 2143078 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 646384380 |
Product | protein of unknown function DUF945 |
Protein accession | YP_003263867 |
Protein GI | 261856584 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC TGGCGATAGG ACTGGGTGTG GTAATTGTCG CCGGCATCGC GGCCGCACCG TACGTTTCCG GCATGATGAT CGAAAAACGT ATGCAATCGG TCAGTGCGCT TCCGGGGCTC TCTGATGGCA TCACCTGGTC GCTTGACTCA TATCAGCGAG GCTATTTGAG TTCGACAGCA ACCTCTCATG TCACACTGGC TGGCGCAGAT GGCACACGCT ATCTCATTCA TTTCAAGCAC ACCATCAATC AGATTCCCGG GGTGGATGGT CGGTATGCCA CGATCCAAAC CGTTTGGGTG CCGGATACGG AAATCAAGCC TCAGGTTGAG CAACTCTTTT CAGGGAAAGA GCCGGTCGTG CTGAACACGG CGCTGAGCGT GTTCGGTGGT GCGCACACCG AGGGTGAATT TGCGCCCATC AACCAACCTC AAGTCACCTT TAGTGGCGGC ACGATGACAA TCGATGCCGC GGCCAGCGGC AAGTTCGCCT ATACCGGCGC GTTTGATTCG CTGAATATCA CCGGTCAGAA AGACGATAAG GGCATGCCTC AATCGGCTGC CTTCAAGGGT ATTACTCTTG ATGCCGATGG GGTGATGGAC AAGAAAAGCC ACATTGCCTG GAACAGCCAG TTTGCAATGA AAGTTGCTTC GTTGACGGTG GGCAACGAGG GCGCTCTTTC CGGATTGGCT CTGACGAGTC ACTCTTTGCG GACCGGCGAT GATTTCGCCG TTGATGTCGG CTTGGATGTG GCTAATGCTG ATTTTTCTGC CGCGCCGCCT GCCTTCCGCA CGATGAAGGA TCTGAAATTC AAATACGGCA TTTCGCGGGT CGACGCCCCC GCGCTTGAAG ACATTGTTAA GCAGGCACAG CTTGCGCAAA AGCAGACCAT GGGTGATCCG GACAAAATCA AACAGGCGGT ATCCATGAGC GTGATGACGC ATCTGCCCGC CTTGCTGAAT GCGGGGCCGA AATTTGAGAT TGATCCGATC AGCTTCAAAT TGCCGGATGG TACCGTGGCG CTGCATTTCT CCGCGGAGTT GCCACCGGGG CATGGCAAGG AAGGGATGAA TAATCCGATG TCGCTGCTTA ATCTTCTCGA CATGAAAGGG GATTTCAGCG TTCCCGAAGC GGTTTATCAG GCGGCTCAGA CTGAAGCCGG GCCAGATCGC CAAGCGGTGA ATGAGCAGCA ACTGCAGCAA ATGGTGCAAA AAGGGTACAT CACGCAGTCC AATGGCATGT TGTCGACCAA TTTTGCCTTC AAGGCCGGAC AGCTCACCAT CAACGGCTTG CCAGCCAATG ATTTGCTGGG CGTGATGTCA GCCATGTCTG CACGGTAA
|
Protein sequence | MNKLAIGLGV VIVAGIAAAP YVSGMMIEKR MQSVSALPGL SDGITWSLDS YQRGYLSSTA TSHVTLAGAD GTRYLIHFKH TINQIPGVDG RYATIQTVWV PDTEIKPQVE QLFSGKEPVV LNTALSVFGG AHTEGEFAPI NQPQVTFSGG TMTIDAAASG KFAYTGAFDS LNITGQKDDK GMPQSAAFKG ITLDADGVMD KKSHIAWNSQ FAMKVASLTV GNEGALSGLA LTSHSLRTGD DFAVDVGLDV ANADFSAAPP AFRTMKDLKF KYGISRVDAP ALEDIVKQAQ LAQKQTMGDP DKIKQAVSMS VMTHLPALLN AGPKFEIDPI SFKLPDGTVA LHFSAELPPG HGKEGMNNPM SLLNLLDMKG DFSVPEAVYQ AAQTEAGPDR QAVNEQQLQQ MVQKGYITQS NGMLSTNFAF KAGQLTINGL PANDLLGVMS AMSAR
|
| |