Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0882 |
Symbol | |
ID | 8534023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 951612 |
End bp | 953570 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646383267 |
Product | peptidase U32 |
Protein accession | YP_003262772 |
Protein GI | 261855489 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGCG GCCAACTTGA ATTGCTCGCC CCCGCACGGG ATGCGGACAT CGGCATCGAA GCCATTAACC ACGGCGCCGA TGCGGTCTAT ATCGGCGGGC CCGGTTTTGG TGCGCGCAAA AGTGCGGATA ACGACGTGGC CGATATCGCC CGTTTGGTCG ACTATGCGCA CCGTTTTCAT GCGCGCGTCC ACGTCACGCT CAACACCATC CTGCGGGATG ACGAGCTTGA ACCAGCCCGC AAGCTGGCGC ACCAACTCTA CGATGCGGGC GTCGATTCGC TCATTATTCA AGACATGGGC TTGCTCGAAC TCGACCTGCC GCCCATCCAG TTGCATGCCA GTACCCAATG CGACATCCGT ACTCCCGAGA AGGCGCGCTT TTTGCAAGAT GCGGGGCTGT CGCAACTGGT GCTGGCGCGC GAGCTGAGCC TGGAGCAGAT CAAGGCCATT CGTTCTGCTA CCACCGTGCC CTTGGAGTTT TTCATCCACG GCGCGTTGTG TGTGGCGTAT TCGGGGCAGT GCTTCATTTC AGAGGCGCAT ACTGGTCGCA GCGCCAATCG CGGTGAGTGC AATCAGGCTT GTCGCCTGCC GTATGAAGTG CGCGACGCCG AGGGTCGCAT TATCGCCCAC GACAAGCATG TGCTGTCGAT GAAAGACAAC GATCAGAGCG CCAACCTCGC CGCATTGATC GATGCGGGCA TCCAGAGCTT CAAGATCGAA GGCCGTTACA AAGACATGGG CTATGTGAAA AACATCACCG CCCACTATCG CGGCTTGCTG GACGGCATTC TCGAACAGCG CCCTGAGTTG AGCCACACGT CCAGTGGGCA CAGTACATTC AACTTCACGC CCGATCCCGC GCAGAACTTC AACCGCGACG CAACCGACTA TTTCGTGCAG GGTCGCAGAG AAGACATTGG TGCGTTCGAC TCGCCCAAGA ACCCCGGCCG TCGGATTGGC TGGGTGAACA AAATCGGCAA AGATACCGTC GAGATCGAAA CCGATGCCGA CATGTTGCTC AATAACGGCG ACGGCCTGTG CTATTTCGAC CTGCACAAGG AACTGGTCGG CATGGCGATC AATACCGCCG AAAAAATAGC GGCTGGCCGC TGGCGGGTTG TGCCCAAAGA TCCGATTGCA GAGCTGCGCC ACCTGAAAAT CGGCACCGAA CTCAATCGCA ACCGCGATAT GAACTGGCAG CGGCTCATGG ACAAAAAATC CGCCGAGCGC CGTATCGATG TGCGCATGCG GCTGGATGAA ACCGCCGATG GCCTCGCGCT GAACCTGACC GACGAAGGCG GTTGCAGCGC CACAGCAACG CTTGCCATCG CCAAGGAGCC CGCCAAGGAC GCATCCCGCA GCGAGAGCAG CCTGCGCGAG AACCTTGGCA AGCTGGGCAC CACGATTTTC AATCCCGTAG AAATCGTGCT CAACCTTAGC CAGCCCTGGT TCGTGCCCGC CTCGCTCGTC AATAACCTGC GCCGAGAAGG CATCGAGGCC CTTGAGGCCG CGCGCAAGGC CGCGTTGCAA CGCCTGCCCC GTGCCAAACC GGTGGACCCG CCGGTGCCGT ACCCCGAAGA CACCCTCAGC TACCTTGCCA ACGTGTTCAA CCACAAGGCG CGTGATTTCT ACGCCAAGCA TGGTGTGAAG TTGATTGAAG CCGCCTTCGA AGCACACGAG GAAGCGGGCG ACGTCTCGCT GATGATCACC AAGCACTGCG TGCGCTATTC ACTCAGCCTC TGCCCCAAAC AGACCCGCGG CGTGACTGGC GTGCATGGCA CCATCAAGGC CGAACCACTC GAACTGATCA ACGGCAGCGA AAAGCTCAAA CTGGTGTTCG ATTGCAAACC CTGCGAAATG CACGTCGTCG GTAAACTCAA GCGGAGCGTG GCGCAATCGA AGGTTAAAGA AGTCCAGACT GCGCCTGTTC GGTTTTACAA GGCGCGGCCG TCCGTTTAA
|
Protein sequence | MNSGQLELLA PARDADIGIE AINHGADAVY IGGPGFGARK SADNDVADIA RLVDYAHRFH ARVHVTLNTI LRDDELEPAR KLAHQLYDAG VDSLIIQDMG LLELDLPPIQ LHASTQCDIR TPEKARFLQD AGLSQLVLAR ELSLEQIKAI RSATTVPLEF FIHGALCVAY SGQCFISEAH TGRSANRGEC NQACRLPYEV RDAEGRIIAH DKHVLSMKDN DQSANLAALI DAGIQSFKIE GRYKDMGYVK NITAHYRGLL DGILEQRPEL SHTSSGHSTF NFTPDPAQNF NRDATDYFVQ GRREDIGAFD SPKNPGRRIG WVNKIGKDTV EIETDADMLL NNGDGLCYFD LHKELVGMAI NTAEKIAAGR WRVVPKDPIA ELRHLKIGTE LNRNRDMNWQ RLMDKKSAER RIDVRMRLDE TADGLALNLT DEGGCSATAT LAIAKEPAKD ASRSESSLRE NLGKLGTTIF NPVEIVLNLS QPWFVPASLV NNLRREGIEA LEAARKAALQ RLPRAKPVDP PVPYPEDTLS YLANVFNHKA RDFYAKHGVK LIEAAFEAHE EAGDVSLMIT KHCVRYSLSL CPKQTRGVTG VHGTIKAEPL ELINGSEKLK LVFDCKPCEM HVVGKLKRSV AQSKVKEVQT APVRFYKARP SV
|
| |