Gene HS_0271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0271 
Symbol 
ID4239472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp269211 
End bp270416 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content39% 
IMG OID638103811 
Producthypothetical protein 
Protein accessionYP_718479 
Protein GI113460417 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATT CCCAAACAAT TATTATCGGT GCAGGTGCGG CAGGCTTGTT TTGTGCTGGT 
CAACTAGGTA AAGCCAAGCA CCAAGTAACG ATTTTAGATA ATGGAAAAAA AGTGGGACGT
AAAATTTTAA TGTCCGGTGG CGGATTTTGT AATTTTACCA ACCTTAATGT TACCCCTCAG
CATTACCTTT CGCAAAATCC TCATTTCGTC AAGTCTGCTC TAGCTCGCTT TACCCAATGG
GATTTTATTG CCCTCATAAC TGACTATGGT ATTAGCTATT ATGAAAAAGA ATCGGGGCAA
CTTTTTTGTC ATAACAGTGC TGAAGATATT GTCAATATGC TACGAGCAGA ATGCGATAAA
TTTCAAGTTA ATACCCAATT ACGTCAACAG ATCGAATACA TTGAAAAATT AGACGGTAAT
GAAAAAGCTC GTTTTCAATT GCAATCTGAC GGAAAAATTT GGCAATGTGA AAATTTAGTT
ATTGCTACCG GCGGATTGTC TATGCCAGCC TTGAGTTCGA CACCTTTCGG TTACCAAGTT
GCCGAACAAT TCGGACTCAA TGTTATTGCT CCACGAGCTG GGCTTGTACC CTTCACATGG
CGTGAAATTG ATAAATTTTA CACCGCACTT TCAGGCATCT CCCTTCCTGT TTGTGTAACC
GCAAAGTGCG GTCAATCTTT CAGCAATAAT TTACTCTTCA CTCATCGGGG GATATCGGGT
CCCGCTATTT TACAAATTTC AAATTATTGG CAAGTGAATG AAAATATAGA AATTGATCTG
TTACCGATGG AAAATATTCA ATTTTTCCTT AACAAGTTAC GTCAGACCTC ACCTAAATTA
CATTTAAAAA CCGCCTTATC ACGCCTATTA CCGCAAAAAC TAGTGGATTT ATGGTTAAAT
CACAACATTA TTCAAGATGA AGTGATTGCC AACCTCAGTA AAGTGCAGTT AAAAAATCTT
GATGATTTAA TTCATCATTG GCAAATTCAA CCTAACGGCA CCGAAGGTTA TCGCACCGCA
GAAGTCACTA TTGGAGGCGT AGATACGCAA GAAATCTCAT CAAAAACGAT GGAAAGTCAT
AAAATCAAAG GTCTTTATTT TATTGGTGAA GTACTTGATG TTACCGGTCG GCTTGGCGGT
TATAACTTCC AATGGGCATG GAGCTCAGCC TATGTGTGTG CAAATGGGAT TTTGAGCAAT
AAATAG
 
Protein sequence
MNYSQTIIIG AGAAGLFCAG QLGKAKHQVT ILDNGKKVGR KILMSGGGFC NFTNLNVTPQ 
HYLSQNPHFV KSALARFTQW DFIALITDYG ISYYEKESGQ LFCHNSAEDI VNMLRAECDK
FQVNTQLRQQ IEYIEKLDGN EKARFQLQSD GKIWQCENLV IATGGLSMPA LSSTPFGYQV
AEQFGLNVIA PRAGLVPFTW REIDKFYTAL SGISLPVCVT AKCGQSFSNN LLFTHRGISG
PAILQISNYW QVNENIEIDL LPMENIQFFL NKLRQTSPKL HLKTALSRLL PQKLVDLWLN
HNIIQDEVIA NLSKVQLKNL DDLIHHWQIQ PNGTEGYRTA EVTIGGVDTQ EISSKTMESH
KIKGLYFIGE VLDVTGRLGG YNFQWAWSSA YVCANGILSN K