Gene HS_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0449 
SymboltbpA 
ID4239926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp475187 
End bp478165 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content47% 
IMG OID638103992 
Producttransferrin-binding protein A 
Protein accessionYP_718659 
Protein GI113460594 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01776] TonB-dependent lactoferrin and transferrin receptors
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0459359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAATC GCTCGGCTCT GTTGAGAAAG CATGACCCAT CCATCTTTTA CTCACACGGA 
ATGAAAAAAA TGTCTACAAA ACCTTTGTTT AAACTTAAGC CAATAACATT GGCTATCAGC
ACGATTTTTT TACCTTTTAC TGAGGCGGTT GCCGATACTG AATCACCGAG TAGCAATACA
GAAGCAGTGC TGGAGTTAGA AGCTATCCAG GTGCAAGCCA AACACGAGAT CAGCAGACAT
GACAATGAAG TCACCGGTTT GGGTAAGGTG GTCAAAAGCA GTGAAGACAT TGATAAAGAA
CTGATTTTGA ATATTCGCGA TTTGACCCGT TATGATCCCG GTATTTCGGT GGTGGAGCAG
GGACGTGGTG CAACGTCAGG CTATGCAATG CGTGGTGTTG ACAGAAACCG CGTGGCTATG
TTGGTGGACG GCTTGGGACA GGCGCAGTCC TATTCTACCT TGAAATCCGA TGCAAACGGC
GGGGCGATTA ATGAAATTGA ATATGAGAAT ATTAAGTCAA TTGAATTGAG CAAGGGGTCC
AGTTCGGCAG AATACGGTAG CGGTGCCTTG GGCGGTGCGG TAGGGTTTCG TACCAAAGAA
GCTGATGATG TGATTAAAGA GGGGCAAAAC TGGGGCTTGG ACAGTAAAAC GGCTTACAGC
AGCAAAAACA GCCAGTTTAT CCAATCCGTT GCCGGTGCGT TCCGTGTCGG CGGTTTTGAC
AGTTTGGCGA TTTTTACTCA TCGTAAAGGT AAGGAAACCC GCGTGCATCC TGCCGCCGAA
GAAATACAAC ATACTTACCA ACCATTGGAA GGGTATTTTA ATCGGTATGA GGTTGACCAA
AACAGCAACG GAACGCCTGT TCGGGCGAAT GCGTATTATA TACTTGCCGA TGAATGTTCT
AATTTAAGTG ATCCGAGTTG TCGTCATGCC AAGGCCAAGA CGAATAGGGT GGGTGCCCCG
GAGAACAATC CTAATTGGAC GCCCGAAGAG CAGGCACAGG CTGCTAAAAT GCCGTATCCG
ACACGTACCG CCTCTGCCAA AGATTATACG GGTCCTGACC GCATCAGCCC TAATCCGATG
GACTACCAAA GTCACTCTTT CTTCTGGAAA GGTGGTTACC GCTTGTCGCC TAACCATTAT
GTCGGCGGGG TGTTGGAACA TACGAAGCAG CGTTACGATA TCCGTGATAT GACGCAACGG
GCGTATTACA CGAAAGAGGA TATCTGCCAC AGCGGATCCA GTTGCCAAAC GTTGGATAAA
AATGAGACGG ACAAAGGTAA TTTCGGTATC ACGTTGACTG ATAATCCTTT GGACGGTTTG
GTATATGATG CCGGCAATCA AGCTCGTGGC GTGCGGTACG GACGGGGTAA ATTTTTTGAT
GAACGCCATA CGAAAAATCG CTCGGGTATT TTTTACCGCT ATGAGAATCC CGATAAAAAT
TCTTGGGCAG ATAGCTTGAC CTTGAGTATT GACCGCCAAG ATCTCAAACT GTCGAGCCGT
ATCCATTGGA CGTATTGCAC CGATTATCCT CATGTGGCAC GTTGCCGTGC CAGCTTGGAC
AAACCTTGGT CTAATTACCG TACCGAGAAA AACGATTATC AAGAACGACT CAATCTGGGA
CAATTCAATT GGGAAAAAAC TTTTAATCTG GGCTTTACCA CGCATAAGGT GAATATCGCC
GCCGGCTTTG GTACACATCG CTCCACCTTA CAACATGGCG ACTTATATGC TGAATATGTC
ACCTTGCCGC CGTATGAGGA AATAAAAGCG TATGATGATA AAGGTGCCTT TAAAACAGAT
ACGACCCCTG AGGATAAATT ACAATACGGC AATGGTTCTT ATGACAAACC TCGCGTATAT
AGACGTAAAA ACACGCCGGA ATTAAAAACT GTCAATGGGT GCAATGAGAC AGCAGGCGAT
AACCGTGACT GCTCGCCACG TGTGATTACG GGCAGACAGT ATTACCTTGC CTTGCGTAAC
CATATTGCCT TTGGTGAATG GGCAGACTTG GGGTTGGGCG TGCGGTACGA CAACCATACC
TTCCGCTCGA ATGACCCGTG GACCAAAGGT GGCAACTACC ACAACTGGTC GTGGAATGCG
GGCGTGAGCC TCAAACCAAC CCGCCACTTT GTCGTGTCTT ACCGTGTGTC CAGCGGTTTC
CGTGTCCCCG CTTTTTATGA GCTGTACGGC GTGCGTACGG GGGCTTCTGG TAAAGACAAT
CCACTCACAC AAAAAGAGTT CTTGAGCCGT AAACCGTTGA AAAGCGAAAA AGCCTTTAAC
CAAGAAATTG GTTTGGCCGT TCAGGGCGAT TTTGGTGTGA TAGAGACCAG TTTCTTCCAA
AACAACTATA AAAACCTGCT TGCCCGTGCA GATAAACATG TCGAGGGATT GGGTTATGTA
ACCGATTTTT ACAACACCCA AGATGTCAAA CTCAACGGTA TCAATATCTT GGGTAGAATC
TACTGGGAAG GCATCAGCGA TAGGCTGCCT GAAGGCTTGT ATTCCACACT TGCTTACAAC
CGTATCAATA TCAAAGCACG CAAATTGCAC GACAATTTTA CCAATGTGTC TGAGCCGACA
TTGGAAGCCG TGCAACCGGG ACGCATTATT GCAAGTATCG GCTATGATGA CCCTGAGGGC
AGATGGGGCC TTAATTTAAG CGGCACCTAC TCTCAAGCCA AACAACGTGA CGAAGTGGTC
GGCGAAAAAG TGTTCGGCAA GGGTGGCAGC ATTAAACGGA CGATCAACAG CAAACGCACT
CGCGCTTGGT ATATTTATGA TTTGACGGCA TACTACACTT GGAAAGAAAA ATTCACGTTG
AGAGCCGGTA TCTATAATTT AACCAATCGT AAATATAGTA CATGGGAAAG TGTGCGTCAG
TCCGCTGCCA ATGCGGTCAA TCAAGACCTA GGTACACGTT CAGCACGTTT TGCCGCACCG
GGCAGAAACT TTACCGTGAG TATGGAAATG AAGTTTTAA
 
Protein sequence
MANRSALLRK HDPSIFYSHG MKKMSTKPLF KLKPITLAIS TIFLPFTEAV ADTESPSSNT 
EAVLELEAIQ VQAKHEISRH DNEVTGLGKV VKSSEDIDKE LILNIRDLTR YDPGISVVEQ
GRGATSGYAM RGVDRNRVAM LVDGLGQAQS YSTLKSDANG GAINEIEYEN IKSIELSKGS
SSAEYGSGAL GGAVGFRTKE ADDVIKEGQN WGLDSKTAYS SKNSQFIQSV AGAFRVGGFD
SLAIFTHRKG KETRVHPAAE EIQHTYQPLE GYFNRYEVDQ NSNGTPVRAN AYYILADECS
NLSDPSCRHA KAKTNRVGAP ENNPNWTPEE QAQAAKMPYP TRTASAKDYT GPDRISPNPM
DYQSHSFFWK GGYRLSPNHY VGGVLEHTKQ RYDIRDMTQR AYYTKEDICH SGSSCQTLDK
NETDKGNFGI TLTDNPLDGL VYDAGNQARG VRYGRGKFFD ERHTKNRSGI FYRYENPDKN
SWADSLTLSI DRQDLKLSSR IHWTYCTDYP HVARCRASLD KPWSNYRTEK NDYQERLNLG
QFNWEKTFNL GFTTHKVNIA AGFGTHRSTL QHGDLYAEYV TLPPYEEIKA YDDKGAFKTD
TTPEDKLQYG NGSYDKPRVY RRKNTPELKT VNGCNETAGD NRDCSPRVIT GRQYYLALRN
HIAFGEWADL GLGVRYDNHT FRSNDPWTKG GNYHNWSWNA GVSLKPTRHF VVSYRVSSGF
RVPAFYELYG VRTGASGKDN PLTQKEFLSR KPLKSEKAFN QEIGLAVQGD FGVIETSFFQ
NNYKNLLARA DKHVEGLGYV TDFYNTQDVK LNGINILGRI YWEGISDRLP EGLYSTLAYN
RINIKARKLH DNFTNVSEPT LEAVQPGRII ASIGYDDPEG RWGLNLSGTY SQAKQRDEVV
GEKVFGKGGS IKRTINSKRT RAWYIYDLTA YYTWKEKFTL RAGIYNLTNR KYSTWESVRQ
SAANAVNQDL GTRSARFAAP GRNFTVSMEM KF