Gene HS_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1751 
Symbol 
ID4241285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1976664 
End bp1978895 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content43% 
IMG OID638105344 
Producthypothetical protein 
Protein accessionYP_719956 
Protein GI113461887 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCATCGA AGTATTTATT AGAAGGGTTA TTGCGTCCAC CTGTGGAGTT TTATCCTGCT 
GCAGCTCACG CTGCCTGTGC CTATATTTGT GCATTCGCAC CTTGGTCCCT TGCCTTAAAT
CCAATGATTG GCTATGGTCT GGCAGCAAGT TTTAGCGGGA TAAGCTATCT GCGTTTTCGC
CAAGGGTGGC GTATTGTACG CTATCATCGT AATTTAAAGC GTCTGCCTCA TTATGCCCTA
ACCAGCCGTC AAATTCCGGT AAATAAAAAA TTCTTATTTT TAGGTCGCGG ATTTGAATGG
CAACCGGAAC ACACCCAGCG GCTCTATGAT TGTTTTTCCG CCGATGGCAA AAAATACTTA
CGACCCTCCA AGCTATTTAA TATGGCAAGA GAATATGAAA AACATCACGA AAATCTAATT
ACTAAGTTAA CAACATCAGA CTCATGCTTT AATCCGGTAA GACCCTTACC GCCGGTTGAA
GGTATTCCTG CGATTCACGG CATTGAGTTA AATGAAATTG ATGTGATGCA ACCGTTAAAT
TCACGCGGTG GACATACAGC CGTTTTTGGA ACGACCGGGG TAGGGAAAAC CCGCTTTGCT
GAGGTGCTTG TTACGCAAGA TATTCATCGG GGAAAAACGC AAGCTGAGCG TGAAGTGGTT
ATCTTTTTTG ATCCCAAAGG CGATCCTGAT ATGTTAAAAC GTATGTATGC CGAAGCTAAA
CGAGCAGGGC GGGATAAGGA ATTTTACATG TTCCATTTAG GGTATCCTGA ACGCTCAGCA
CGTTATAACC CTATTGGTCG CTTTGGTCGT GTTTCAGAGG TCGCTGGGAG AATTTCAGGG
CAATTAAGTG GGGCGGGGAA TTCTGCCGCA TTTAAGGAAT TTGCTTGGCG TTTTGTGAAT
ATTGTCGCAA GAGCTTTGGT TGAAATGGGT GAGCGTCCGA ATTATTCACA AATTTCTCGT
TTTGTACAAA ATATTGATTC TCTTTTTCTT AATTACGCCA AAACCTACTT TGATGCTCGG
GATACCGCAC TTTGGCCACA GCTATTAACC ATTGCAGCAG GCGTCGATGT AAAAAATTTA
TCCTTTGGTA TGAAAGACCG TCCGCTGATT TGGGTGATTA ATCAATATAT TTTAGAAAAT
AAAATTTTCG ATCCCGTCTT GGAGGGGTTG GCGACAGCAG TGCGTTATGA TAAAACTTAT
TTTGATAAAA TTGTGGCGTC GTTATTACCG CTCTTGGAAA AATTGACGAC AGGGAAAATC
GAAGAGTTAC TATCACCAGA TTATTCAGAT GTGAATGATG AACGACCTAT TTTTGATTGG
GAGGAAGTGA TTCGTAAACG GGGGATTGTC TATGTGGGGC TGGACGCCTT ATCTGATGCT
ACGGTTGCAG CTGCTGTGGG CAACAGTATG TTTGCCGATT TGGTTTCAAT GGCGGGGCAT
ATTTATAAGC ACGGAGTGGA TGAAGGATTG CCAGATGCAT TTAAAGGTCA GTCAAAGCCG
GTTAAAATCA ATTTACATTG TGATGAATTT AACGAGTTGA TGGGTGATGA ATTCATTCCG
TTGATTAATA AGGGGCGAGG AGCCGGAATG CAGGTTACGG CTTATACTCA AACCATTGCT
GATATTGAAG CCCGTCTTGG CAATCGGGCT AAAGCGGAAC AAACCATTGG TAACTTTAAT
ACGTTGATAA TGTTTCGGGT AAAATCACCG GCAACGGCAA AACTATTAAC CGATCAGCTA
CATAAAGTGA CGATTTTACA AAGTATGGTG ACTTCCAGTT TTACCGACTC CAGTAATCCT
GATGACGACA AAGCCTTTAC CTCAAATACC GGGGAACGCA TCTCGCAAAA AGAAGTGCCG
TTATTAGATG TGGCGAATGT AACCAATCTC CCTAAAGGGC AGGCGTTTAT TTTGATGAAC
GGGAGTACGT TATATAAAGT GCGGATGCCA TTACCTGCAA TTGATAAAGA AGATGTGATC
CCAGCTTCGA TGCAAGAACT CTTGGATAAA ATGCACAAAC ATTATCATGT AGCAACAAAT
TGGTGGGAGC CGGCATTTAA AGACTATACG CCGTCAAAGG ATATTGCCGA TTCATTTGAA
AATATGGTGA CAGAGGAAAA AATGGCAAAA GCCGATTTAA ATTGGGGTGC TGATATTGAC
CTGCCTGAAG ATGATGAAGC TGAGTCGATA AGTCAAGCCG ATACGGATGA ACAAGATATG
GAGGAAGAAT GA
 
Protein sequence
MASKYLLEGL LRPPVEFYPA AAHAACAYIC AFAPWSLALN PMIGYGLAAS FSGISYLRFR 
QGWRIVRYHR NLKRLPHYAL TSRQIPVNKK FLFLGRGFEW QPEHTQRLYD CFSADGKKYL
RPSKLFNMAR EYEKHHENLI TKLTTSDSCF NPVRPLPPVE GIPAIHGIEL NEIDVMQPLN
SRGGHTAVFG TTGVGKTRFA EVLVTQDIHR GKTQAEREVV IFFDPKGDPD MLKRMYAEAK
RAGRDKEFYM FHLGYPERSA RYNPIGRFGR VSEVAGRISG QLSGAGNSAA FKEFAWRFVN
IVARALVEMG ERPNYSQISR FVQNIDSLFL NYAKTYFDAR DTALWPQLLT IAAGVDVKNL
SFGMKDRPLI WVINQYILEN KIFDPVLEGL ATAVRYDKTY FDKIVASLLP LLEKLTTGKI
EELLSPDYSD VNDERPIFDW EEVIRKRGIV YVGLDALSDA TVAAAVGNSM FADLVSMAGH
IYKHGVDEGL PDAFKGQSKP VKINLHCDEF NELMGDEFIP LINKGRGAGM QVTAYTQTIA
DIEARLGNRA KAEQTIGNFN TLIMFRVKSP ATAKLLTDQL HKVTILQSMV TSSFTDSSNP
DDDKAFTSNT GERISQKEVP LLDVANVTNL PKGQAFILMN GSTLYKVRMP LPAIDKEDVI
PASMQELLDK MHKHYHVATN WWEPAFKDYT PSKDIADSFE NMVTEEKMAK ADLNWGADID
LPEDDEAESI SQADTDEQDM EEE