Gene HS_0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0411 
Symbolgcp 
ID4239887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp436731 
End bp437759 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content41% 
IMG OID638103954 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_718621 
Protein GI113460557 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTC TAGGCATAGA AACATCTTGT GATGAAACAG GCGTTGCCAT TTACGATGAA 
AAAAAAGGAT TGATTGCCAA TCAACTGTAT ACTCAAATTG CTTTACATGC TGATTATGGC
GGTGTTGTCC CTGAATTAGC GTCTCGTGAT CATATCCGTA AAACCGCACC ATTAATTCAG
GCGGCTTTAC AACAAGCGGG ATTAGAAGCA AAAGACATTG ATGGCATAGC TTACACTTGT
GGACCGGGAT TGGTGGGTGC ACTTTTAGTA GGTTCAACAA TTGCACGCTC GCTTGCCTAT
GCTTGGAACA TTAAAGCTAT TGGTGTACAT CATATGGAAG GGCATTTACT TGCCCCTATG
TTAGAAAATA ATCCCCCAAA ATTTCCTTTT GTAGCGTTAT TAGTATCCGG CGGACATACA
CAGCTTGTTC GTGTTAATGC TGTAGGGCAA TATGAATTAC TGGGAGAAAG TATTGACGAT
GCTGCCGGTG AAGCATTTGA TAAAACGGCA AAATTATTAG GATTGGATTA CCCTGGAGGG
AGTGCACTTT CACGTTTGGC AGAAAAAGGG AATCCCGAAC GTTTTTTCTT TCCTCGTCCT
ATGACAGATC GCCCCGGCTT AGATTTTAGT TTTTCAGGGT TAAAAACTTT TGCCGCCAAT
ACAATTAATC AAGCAATTAA ACAAGAAGGT GAACTCACTG AACAAACTAA AGCAGATATC
GCCTACGCAT TCCAACAAGC AGTAGTAGAT ACGTTAGCAA TTAAATGTCG TCGAGCTTTA
AAAGAAACAG GCTTTAAACG CTTAGTCATT GCAGGCGGTG TGAGTGCAAA TAAACAATTA
CGTCAATCTT TAGCGGATAT GATGAAACAA TTAAAAGGAG AGGTCTTTTA CCCTCAACCT
CAATTTTGTA CAGATAACGG TGCAATGATT GCTTACGTCG GTTTTTTGCG TCTAAAACAA
GGTGAATATT CACCTTTGGA AATTGACGTT AAACCCCGCT GGGCAATGAC TGAACTGAAA
GCAATTTAA
 
Protein sequence
MRILGIETSC DETGVAIYDE KKGLIANQLY TQIALHADYG GVVPELASRD HIRKTAPLIQ 
AALQQAGLEA KDIDGIAYTC GPGLVGALLV GSTIARSLAY AWNIKAIGVH HMEGHLLAPM
LENNPPKFPF VALLVSGGHT QLVRVNAVGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
SALSRLAEKG NPERFFFPRP MTDRPGLDFS FSGLKTFAAN TINQAIKQEG ELTEQTKADI
AYAFQQAVVD TLAIKCRRAL KETGFKRLVI AGGVSANKQL RQSLADMMKQ LKGEVFYPQP
QFCTDNGAMI AYVGFLRLKQ GEYSPLEIDV KPRWAMTELK AI