Gene HS_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0461 
Symbol 
ID4239943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp491711 
End bp493087 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content36% 
IMG OID638104009 
ProductD-alanine glycine permease 
Protein accessionYP_718672 
Protein GI113460606 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTAG AGATAATTTT ATCGTCTATC AATACTTTTG TTTGGGGACC TCCCTTACTT 
CTTCTTTTAT TTGGTACAGG TCTTTACTTA ACCTTACGCT TAGGATTTAT TCAAATTCGT
TACTTACCTC GTGCATTATA TTATTTATTT GATAAAGAAC GTCATGTAAG TAAAAAAGGG
GATATTTCTG CTTTCGCAGC CCTTTGTACG GCTTTAGCGG CAACAATAGG TACGGGAAAT
ATTGTTGGTG TTGCAACTGC TTTGCAAGCC GGTGGGCCAG GGGCTATTTT TTGGATGTGG
TTAGTTGCTT TACTTGGAAT GGCAACTAAA TATGCTGAGT GTTTATTAGC GGTGAAATAT
CGGGTGCGAG ATAAACAAGG TTTTATGAGT GGCGGACCAA TGTACTATAT TGAGAAAGGC
TTAGGAATAA AATGGCTTGC CAAATTTTTT GCCGTCTGTG GTGTATTAGT TGCATTTTTG
GGCATTGGCA CTTTTCCTCA AATTAATGCA ATCACGCATG CTATGCAAGA TACCTTTAAC
GTATCTATTG AAATTACAGC AACCATTATT ACCGCACTTA TTGCACTTAT TATCTTAGGC
GGTGTTAAAC GCATTTCAAC AGTTGCATCT ATTATTGTTC CTTTTATGGC TGTTTTATAT
GTACTTACGT CTTTGTTAAT TCTTATATTA AATTGGCAAC AAGTACCAAC CGCTATTGGA
CTAATTATTT ACAGTGCATT TAATCCTCAA GCAGCATTAG GCGGTGTATT TGGATATACT
GTTCTGAAAG CTATTCAATC AGGATTTGCA AGGGGTATTT TCTCTAATGA ATCCGGATTA
GGTAGTGCAC CAATTGCGGC GGCAGCAGCT CAAACTAAAG AACCTGTTCG TCAAGGATTA
ATTTCGATGA TAGGTACTTT TTTAGATACT ATCGTTGTGT GTACTATGAC AGGTATCGTG
CTTGTTTTAA CAGGTGCTTG GCAATCACAA GAATTGGCTG GTGCCGCTTT AACTAACTAT
GCATTTTCAC AAGGGTTAGG AAATAATATC GGTGCAACAA TTGTAACTGT AGGGTTACTG
TTTTTTGCTT TTACCACTAT TTTAGGTTGG TGTTATTACG GAGAACGTTG TTTTGTTTAT
CTGGCAGGTG TCAAAGGTAT CAAAGTTTAT CGTTCAATTT TTATACTTTT AGTTGCCTGT
GGTGCTTTTA TTAAACTGGA ATTAATTTGG ATTTTGGCTG ATATTGTGAA TGGTTTAATG
GCATTTCCAA ATTTAATTGC TTTAGTTGGA TTGCGACATA TTGTGATCAA TGAAACTAAA
GATTATTTTA TTCGCAGGAA ACAAATATTT GTTGAGCCAA ATAATGTTGC TAATTGA
 
Protein sequence
MSLEIILSSI NTFVWGPPLL LLLFGTGLYL TLRLGFIQIR YLPRALYYLF DKERHVSKKG 
DISAFAALCT ALAATIGTGN IVGVATALQA GGPGAIFWMW LVALLGMATK YAECLLAVKY
RVRDKQGFMS GGPMYYIEKG LGIKWLAKFF AVCGVLVAFL GIGTFPQINA ITHAMQDTFN
VSIEITATII TALIALIILG GVKRISTVAS IIVPFMAVLY VLTSLLILIL NWQQVPTAIG
LIIYSAFNPQ AALGGVFGYT VLKAIQSGFA RGIFSNESGL GSAPIAAAAA QTKEPVRQGL
ISMIGTFLDT IVVCTMTGIV LVLTGAWQSQ ELAGAALTNY AFSQGLGNNI GATIVTVGLL
FFAFTTILGW CYYGERCFVY LAGVKGIKVY RSIFILLVAC GAFIKLELIW ILADIVNGLM
AFPNLIALVG LRHIVINETK DYFIRRKQIF VEPNNVAN