Gene HS_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0427 
SymbolvirE 
ID4239903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp454967 
End bp456397 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content33% 
IMG OID638103970 
Productvirulence-associated protein E 
Protein accessionYP_718637 
Protein GI113460573 
COG category[R] General function prediction only 
COG ID[COG5545] Predicted P-loop ATPase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATCA CAGAAAAAAG AAAGGATTTT ATCTTGAATA GTTGGATTAA GATTTCCGAT 
GATTGCTTAA AAGAATACCT ATCAAGAGAG TTAGAACTGA CAGATAAGGG ACAAGTGAAA
AGTACAACTA CAAATATTAT TACAGCGATT GTAAATCCAG ATTATTGTGT TAGTCATAAG
ATTTTAAACG GTAGCGTGTT TTTTGACACA TGCTCACAGA CGATTAGGCT TATAGGTTCA
ATTAAAGGGG AAAGTGAAGT TAATTTAAAA GCTCCAAGAA AATGGACGGA TCAACTTACT
AATCTACTCG GTGTAGAAAT TGAAATGAAC TTTGGTATTA AATATTCTAA AGGCAGAATG
GAAGAAGCGG TAATCTTCAT TGCAAATAAA AGAAGAGTTA ATTTACCAAA ATTATATATG
AAATCTCTTA AGTATGATGG TGAAGATTAT ATTTCAAAGC TGCTTCCTAA ATATCTTGGT
GTAGATGATA CAGCTCTTAA TCGTTGGATT ATGGAACATA TGCTGATTGG AATGGTAAAT
AGAGTGTTTT ATCCTGGATG TAAATTTGAT GAGATTATGG TTCTTACTGG TGAGCAAGGC
GTTGGTAAAA CGTCTTTTAT AGAAAAATTG GCACTACTTC CTGATTGGTA TTGTTCCCTA
AATAATATCA AAGGTAAGGA CGCTGTAAGT AATCTAGTAG GTAAAATTGT AGTAGAGCTT
GAAGAGTTTG TTGCCCTTAA AAATGCCAAG ACAGCAGATG AAGCAAAGCT ATTTATTTCT
ACGAGAACTA GCACAGTAAG ATTGTCTTAT GAGAGATTTT CGGCTGATGT AGATAGAACA
TGTATCATGA TTGCTACAAC AAACGACATG ACTTTCTTAG GAGATTTTTC TGGAGAAAGA
AGATATTTAC CTGTGCAAGT TCATAAAGAA AAAGTTGGAT TGCCTGTAAT GTATGACCAA
GAGAAATTTC CACAATTAAA AGGTGTAAGC AGAGAAGAAT ATTCAAAAAT AGTAAAGAAA
GACTTTGAAG GAGCAGTAGC TCAAGCGGTG TATCTTTTTG AAAATAAACT ATATAGTCCA
GTGCTTCCGG TAGAGCTAAG AAAAGATTTA AATCAAGTAA TACAAATGCA CAAGAACGAA
AACCGACATG TGCAAAATTT CTTTGAGTTT ATGGATTGGA AAGATACAAA ATCAGATACA
CCAAATCGTG TTTGTTCTGG AGAGTTTTTA TCCCAGTATC CACAAACTAA TGAAAAAGTA
TTTGCAGAAT TGATGGCAAA TGAAATGGCT GATAAATGGG AATTAGAGCC GACAGATAAA
AGCAGGAAGT TTAAGATTGA TGGCAGAGTA AGGGTGAGTA AGAAGTTTTA TGTAAGAAAG
AATATGCCTG ATTTTATAGA AGTTACAGAT GATATTGAAA TACCATTTTA G
 
Protein sequence
MTITEKRKDF ILNSWIKISD DCLKEYLSRE LELTDKGQVK STTTNIITAI VNPDYCVSHK 
ILNGSVFFDT CSQTIRLIGS IKGESEVNLK APRKWTDQLT NLLGVEIEMN FGIKYSKGRM
EEAVIFIANK RRVNLPKLYM KSLKYDGEDY ISKLLPKYLG VDDTALNRWI MEHMLIGMVN
RVFYPGCKFD EIMVLTGEQG VGKTSFIEKL ALLPDWYCSL NNIKGKDAVS NLVGKIVVEL
EEFVALKNAK TADEAKLFIS TRTSTVRLSY ERFSADVDRT CIMIATTNDM TFLGDFSGER
RYLPVQVHKE KVGLPVMYDQ EKFPQLKGVS REEYSKIVKK DFEGAVAQAV YLFENKLYSP
VLPVELRKDL NQVIQMHKNE NRHVQNFFEF MDWKDTKSDT PNRVCSGEFL SQYPQTNEKV
FAELMANEMA DKWELEPTDK SRKFKIDGRV RVSKKFYVRK NMPDFIEVTD DIEIPF