Gene HS_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1039 
Symbol 
ID4240537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1145754 
End bp1147193 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content38% 
IMG OID638104600 
Productamino acid carrier protein 
Protein accessionYP_719251 
Protein GI113461182 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCAA CTATAATAAT AAACATTGAA CAAGCACTAA GTTGGTTTGT TGATAAATTT 
GATGGACCAC TATGGGATTT AGCGACTATT ACATTACTTG GTGTAGGTGT GTTTTTTACG
TTAGCAACCG GCTTTATTCA ATTACGTTTA TTTCCACAAA GTTTGCGTGA AATGTGGTTT
GGACGGGCAG TTCAAGGACA ATCTTTAACG CCATTTCAGG CATTTACTAC TGGTTTAGCC
AGCCGTGTAG GTGTGGGTAA TATCGGAGGC GTTGCAACGG CAATAGCACT TGGCGGGGAA
GGTGCTGTCT TTTGGATGTG GGTAACGGCG TTGATTGGAA TGTCCAGTGC ATTTGCTGAA
TCTTCTTTGG CTCAGTTATT TAAAGTTAAA GATAAAAACG GGTTATTTCG TGGCGGACCT
GCCTATTATA TTACAAGAGG ATTGAAAGCT CCTTGGTTGG CAGTATGTTT TGCAATTGCT
TTAATATTTA CGTTCGGATT TGCGTTTAAT GCAGTTCAAT CCAATGCTAT TGTTGAGGCA
ACAAAAAATG CTTGGAAATG GCAACCGCAC TATGTCGGGG TAACTCTCGT TATCGTAACG
GGGTTAATTA TTTTTGGTGG AGTGAAACGT ATTGGTAAAG TTTCTGCACA AATTGTTCCT
ATGATGGCGT TGTTTTATCT AATCATTGCG GTGATTATTT TAGGTATGAA TATTGAAATG
GTACCGACAG TGATAAGCCG TATTATTCAA AGTGCTTTTA ATTTTGACGC AATGGCTGGT
GGTATGTTCG GTGCAATTTT TTCTAAAGCT ATGTTAATGG GAATTAAACG AGGACTTTTC
TCTAATGAGG CAGGTATGGG GTCTGCACCA AATGCGGCAG CATCGGCAGA TGTTAAGCAC
CCTGTAAGTC AAGGGTTGAT CCAAATGCTG GGTGTGTTTG TAGATACTAT TATTGTGTGT
ACTTGTACTG CGGTGATTAT TTTATTATCA GATAATTATG GTGGCGAACA ACTGAAAAAC
ATTTCATTAA CCCAATATGC TTTACAGTAC CATGTTGGTG AATTTGGCTT ACATTTCTTA
GCTTTTATCC TATTGTTATT TGCATTTTCT TCTATTATCG GAAACTACGC TTATGCAGAA
AGTAACATTC GTTTTATTCG CAATAAACCA TTGTTTATTC TCACTTTCCG TTTAATTGTG
TTGTTCTTTG TGTATTTTGG TGCAGTCAAT TCAGGAAATA TTGTATGGAA CTTTGCAGAT
ACGGTGATGG CGATTATGGC ATTAATTAAC CTTGTGTCTA TTGTTTTATT GGCACCGATA
GTTATGTTGT TGCTAAAAGA TTACCGCCAA CAGCTCAAAG CGGGTAAAGA TCCTGAATTT
AAAATTGAAC AATACCCTCA ATTACTTCGT AAAGGCGTTG ATCCTACTCT TTGGAAATAA
 
Protein sequence
MFSTIIINIE QALSWFVDKF DGPLWDLATI TLLGVGVFFT LATGFIQLRL FPQSLREMWF 
GRAVQGQSLT PFQAFTTGLA SRVGVGNIGG VATAIALGGE GAVFWMWVTA LIGMSSAFAE
SSLAQLFKVK DKNGLFRGGP AYYITRGLKA PWLAVCFAIA LIFTFGFAFN AVQSNAIVEA
TKNAWKWQPH YVGVTLVIVT GLIIFGGVKR IGKVSAQIVP MMALFYLIIA VIILGMNIEM
VPTVISRIIQ SAFNFDAMAG GMFGAIFSKA MLMGIKRGLF SNEAGMGSAP NAAASADVKH
PVSQGLIQML GVFVDTIIVC TCTAVIILLS DNYGGEQLKN ISLTQYALQY HVGEFGLHFL
AFILLLFAFS SIIGNYAYAE SNIRFIRNKP LFILTFRLIV LFFVYFGAVN SGNIVWNFAD
TVMAIMALIN LVSIVLLAPI VMLLLKDYRQ QLKAGKDPEF KIEQYPQLLR KGVDPTLWK