Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0411 |
Symbol | gcp |
ID | 4239887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 436731 |
End bp | 437759 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638103954 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_718621 |
Protein GI | 113460557 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTC TAGGCATAGA AACATCTTGT GATGAAACAG GCGTTGCCAT TTACGATGAA AAAAAAGGAT TGATTGCCAA TCAACTGTAT ACTCAAATTG CTTTACATGC TGATTATGGC GGTGTTGTCC CTGAATTAGC GTCTCGTGAT CATATCCGTA AAACCGCACC ATTAATTCAG GCGGCTTTAC AACAAGCGGG ATTAGAAGCA AAAGACATTG ATGGCATAGC TTACACTTGT GGACCGGGAT TGGTGGGTGC ACTTTTAGTA GGTTCAACAA TTGCACGCTC GCTTGCCTAT GCTTGGAACA TTAAAGCTAT TGGTGTACAT CATATGGAAG GGCATTTACT TGCCCCTATG TTAGAAAATA ATCCCCCAAA ATTTCCTTTT GTAGCGTTAT TAGTATCCGG CGGACATACA CAGCTTGTTC GTGTTAATGC TGTAGGGCAA TATGAATTAC TGGGAGAAAG TATTGACGAT GCTGCCGGTG AAGCATTTGA TAAAACGGCA AAATTATTAG GATTGGATTA CCCTGGAGGG AGTGCACTTT CACGTTTGGC AGAAAAAGGG AATCCCGAAC GTTTTTTCTT TCCTCGTCCT ATGACAGATC GCCCCGGCTT AGATTTTAGT TTTTCAGGGT TAAAAACTTT TGCCGCCAAT ACAATTAATC AAGCAATTAA ACAAGAAGGT GAACTCACTG AACAAACTAA AGCAGATATC GCCTACGCAT TCCAACAAGC AGTAGTAGAT ACGTTAGCAA TTAAATGTCG TCGAGCTTTA AAAGAAACAG GCTTTAAACG CTTAGTCATT GCAGGCGGTG TGAGTGCAAA TAAACAATTA CGTCAATCTT TAGCGGATAT GATGAAACAA TTAAAAGGAG AGGTCTTTTA CCCTCAACCT CAATTTTGTA CAGATAACGG TGCAATGATT GCTTACGTCG GTTTTTTGCG TCTAAAACAA GGTGAATATT CACCTTTGGA AATTGACGTT AAACCCCGCT GGGCAATGAC TGAACTGAAA GCAATTTAA
|
Protein sequence | MRILGIETSC DETGVAIYDE KKGLIANQLY TQIALHADYG GVVPELASRD HIRKTAPLIQ AALQQAGLEA KDIDGIAYTC GPGLVGALLV GSTIARSLAY AWNIKAIGVH HMEGHLLAPM LENNPPKFPF VALLVSGGHT QLVRVNAVGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG SALSRLAEKG NPERFFFPRP MTDRPGLDFS FSGLKTFAAN TINQAIKQEG ELTEQTKADI AYAFQQAVVD TLAIKCRRAL KETGFKRLVI AGGVSANKQL RQSLADMMKQ LKGEVFYPQP QFCTDNGAMI AYVGFLRLKQ GEYSPLEIDV KPRWAMTELK AI
|
| |