Gene HS_1271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1271 
Symbol 
ID4240782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1456960 
End bp1458546 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content40% 
IMG OID638104844 
ProductABC transporter, periplasmic binding protein 
Protein accessionYP_719483 
Protein GI113461414 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000259921 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATT ACATTGAAAA TGAATCTCGC CGTAATTTCA TGAAAGTATT GGCTGGTGTC 
GGTGCAGGAA TTGCCTTTAG CGGTACATTA GGTAGTTTTG TTTCTAAGGC GAAGGCCACT
TCAACAACAG GTAGTAGCAT TGAAGCAGGA ATAGCTTATC CCCTATCAAC CGGTTTTGAT
CCGTCCACTT CCAGTGGAGC CTCTTCCTTT GCCGCCAACA TGCACTTTTT TGAAGGATTA
GTGGATTTAC ACCCTGCCAC CCGTGAACCT TATCTCGCCT TAGCTGCAAA AGAACCTGAA
CAAGTTGATG ATATAACATG GCGTGTTACA TTACGTGACG GTGCGACTTT CCACGATGGT
ACACCTGTTA CCACGGAGGA TATAGTATAT TCTTATCAGC GTATTTTAGA TCCAAAAAAT
GCTTCACTAT TTATACAATT CATTCCGTTT ATTGATTCAG TCAACGCTTT AGATGATAAA
GTGGTAGAAT TTAAGTTGAA GTACCCGTTT TCTCTTTTTA AATTGCGATT AGGTATTGTA
AAAATTGTAC CTAAACATGT GATTGAGGCT GTAGGACAAA CGGTATTTGA TGCAAATCCG
GTAGGATCCG GACCATATAA ATTTGTTTCA GCCGTAAAAG ACGATCGCAT CGTATTTGCC
GCACACAATG CTTACAACGG CCCATATCCT GCTCGTGTAG AAAAAATGAC ATGGTTTTTA
CTATCCGATG ATGCCGCTCG CACAGCGGCA CAAGAGTCCG GACGTACTCA AGCAATGGAA
AGTGTTCCTT ATTTAGACGT ATCTCGCTTA AAACGTAAAA GTGCGGTTGA ATCCGTGCAA
TCTTTCGGGT TGTTATTCTT AATGTTTAAT TGTAAGAAAG CCCCATTTAA CAATCCCAAA
GTACGTCAAG CTTTACACTA TGCACTGGAT ACACAAAAAT TAATTGATAT TGTATTCTTA
GGTAATGCAA AAGCCGCCAC TTCTTACACT CAAGATACGC ATCCTGATTA TGTCAAGGCT
TCAACCCAGT ATGATTATAA TCCTGAAAAA GCCACCGCGC TTTTAAAAGA AGCAGGTCTG
AATAAATTAG AGTTTCAATT GCTTTCTACA GATCATTCTT GGATAAAAGA ATGTGCCCCT
TTAGTTTTAG AATCTTGGAA TAAAATTCCG GGTGTGAAAG TAACTTTACA GCATCTACAA
TCTGGAGCAT TGTATGGTGG TTACGTAGAT AAAGGAAATT ATGAAGTCGT AATGGCACCG
GGGGATCCAT CAGTATTTAG CAACGATTTG GATCTCTTAT TGAGCTGGTG GTATCGTGGT
GATGTTTGGC CGAAAAAACG CTTCGGTTGG TCAGATACCC CTGAATATGC GAAATTGCAA
TTATTACTTG ATGATGCAAT TAAAGCAAAA CAGCCATCAG ATGCAAAAGC GGCTTGGACT
CAGGCTATTA ATCTTATTGC TGAACAAGTA CCGCTTTATC CAATTTTGCA CCGTAAATTA
CCAACCGCTT GGAATAACAA ATCACTAGAT GGTTTCCAAC CAATTCCAAC TACCGGACTT
TCATTTATTG GTGTAGGTCG CAAATAA
 
Protein sequence
MTNYIENESR RNFMKVLAGV GAGIAFSGTL GSFVSKAKAT STTGSSIEAG IAYPLSTGFD 
PSTSSGASSF AANMHFFEGL VDLHPATREP YLALAAKEPE QVDDITWRVT LRDGATFHDG
TPVTTEDIVY SYQRILDPKN ASLFIQFIPF IDSVNALDDK VVEFKLKYPF SLFKLRLGIV
KIVPKHVIEA VGQTVFDANP VGSGPYKFVS AVKDDRIVFA AHNAYNGPYP ARVEKMTWFL
LSDDAARTAA QESGRTQAME SVPYLDVSRL KRKSAVESVQ SFGLLFLMFN CKKAPFNNPK
VRQALHYALD TQKLIDIVFL GNAKAATSYT QDTHPDYVKA STQYDYNPEK ATALLKEAGL
NKLEFQLLST DHSWIKECAP LVLESWNKIP GVKVTLQHLQ SGALYGGYVD KGNYEVVMAP
GDPSVFSNDL DLLLSWWYRG DVWPKKRFGW SDTPEYAKLQ LLLDDAIKAK QPSDAKAAWT
QAINLIAEQV PLYPILHRKL PTAWNNKSLD GFQPIPTTGL SFIGVGRK