Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0449 |
Symbol | tbpA |
ID | 4239926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 475187 |
End bp | 478165 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638103992 |
Product | transferrin-binding protein A |
Protein accession | YP_718659 |
Protein GI | 113460594 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01776] TonB-dependent lactoferrin and transferrin receptors [TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0459359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAAATC GCTCGGCTCT GTTGAGAAAG CATGACCCAT CCATCTTTTA CTCACACGGA ATGAAAAAAA TGTCTACAAA ACCTTTGTTT AAACTTAAGC CAATAACATT GGCTATCAGC ACGATTTTTT TACCTTTTAC TGAGGCGGTT GCCGATACTG AATCACCGAG TAGCAATACA GAAGCAGTGC TGGAGTTAGA AGCTATCCAG GTGCAAGCCA AACACGAGAT CAGCAGACAT GACAATGAAG TCACCGGTTT GGGTAAGGTG GTCAAAAGCA GTGAAGACAT TGATAAAGAA CTGATTTTGA ATATTCGCGA TTTGACCCGT TATGATCCCG GTATTTCGGT GGTGGAGCAG GGACGTGGTG CAACGTCAGG CTATGCAATG CGTGGTGTTG ACAGAAACCG CGTGGCTATG TTGGTGGACG GCTTGGGACA GGCGCAGTCC TATTCTACCT TGAAATCCGA TGCAAACGGC GGGGCGATTA ATGAAATTGA ATATGAGAAT ATTAAGTCAA TTGAATTGAG CAAGGGGTCC AGTTCGGCAG AATACGGTAG CGGTGCCTTG GGCGGTGCGG TAGGGTTTCG TACCAAAGAA GCTGATGATG TGATTAAAGA GGGGCAAAAC TGGGGCTTGG ACAGTAAAAC GGCTTACAGC AGCAAAAACA GCCAGTTTAT CCAATCCGTT GCCGGTGCGT TCCGTGTCGG CGGTTTTGAC AGTTTGGCGA TTTTTACTCA TCGTAAAGGT AAGGAAACCC GCGTGCATCC TGCCGCCGAA GAAATACAAC ATACTTACCA ACCATTGGAA GGGTATTTTA ATCGGTATGA GGTTGACCAA AACAGCAACG GAACGCCTGT TCGGGCGAAT GCGTATTATA TACTTGCCGA TGAATGTTCT AATTTAAGTG ATCCGAGTTG TCGTCATGCC AAGGCCAAGA CGAATAGGGT GGGTGCCCCG GAGAACAATC CTAATTGGAC GCCCGAAGAG CAGGCACAGG CTGCTAAAAT GCCGTATCCG ACACGTACCG CCTCTGCCAA AGATTATACG GGTCCTGACC GCATCAGCCC TAATCCGATG GACTACCAAA GTCACTCTTT CTTCTGGAAA GGTGGTTACC GCTTGTCGCC TAACCATTAT GTCGGCGGGG TGTTGGAACA TACGAAGCAG CGTTACGATA TCCGTGATAT GACGCAACGG GCGTATTACA CGAAAGAGGA TATCTGCCAC AGCGGATCCA GTTGCCAAAC GTTGGATAAA AATGAGACGG ACAAAGGTAA TTTCGGTATC ACGTTGACTG ATAATCCTTT GGACGGTTTG GTATATGATG CCGGCAATCA AGCTCGTGGC GTGCGGTACG GACGGGGTAA ATTTTTTGAT GAACGCCATA CGAAAAATCG CTCGGGTATT TTTTACCGCT ATGAGAATCC CGATAAAAAT TCTTGGGCAG ATAGCTTGAC CTTGAGTATT GACCGCCAAG ATCTCAAACT GTCGAGCCGT ATCCATTGGA CGTATTGCAC CGATTATCCT CATGTGGCAC GTTGCCGTGC CAGCTTGGAC AAACCTTGGT CTAATTACCG TACCGAGAAA AACGATTATC AAGAACGACT CAATCTGGGA CAATTCAATT GGGAAAAAAC TTTTAATCTG GGCTTTACCA CGCATAAGGT GAATATCGCC GCCGGCTTTG GTACACATCG CTCCACCTTA CAACATGGCG ACTTATATGC TGAATATGTC ACCTTGCCGC CGTATGAGGA AATAAAAGCG TATGATGATA AAGGTGCCTT TAAAACAGAT ACGACCCCTG AGGATAAATT ACAATACGGC AATGGTTCTT ATGACAAACC TCGCGTATAT AGACGTAAAA ACACGCCGGA ATTAAAAACT GTCAATGGGT GCAATGAGAC AGCAGGCGAT AACCGTGACT GCTCGCCACG TGTGATTACG GGCAGACAGT ATTACCTTGC CTTGCGTAAC CATATTGCCT TTGGTGAATG GGCAGACTTG GGGTTGGGCG TGCGGTACGA CAACCATACC TTCCGCTCGA ATGACCCGTG GACCAAAGGT GGCAACTACC ACAACTGGTC GTGGAATGCG GGCGTGAGCC TCAAACCAAC CCGCCACTTT GTCGTGTCTT ACCGTGTGTC CAGCGGTTTC CGTGTCCCCG CTTTTTATGA GCTGTACGGC GTGCGTACGG GGGCTTCTGG TAAAGACAAT CCACTCACAC AAAAAGAGTT CTTGAGCCGT AAACCGTTGA AAAGCGAAAA AGCCTTTAAC CAAGAAATTG GTTTGGCCGT TCAGGGCGAT TTTGGTGTGA TAGAGACCAG TTTCTTCCAA AACAACTATA AAAACCTGCT TGCCCGTGCA GATAAACATG TCGAGGGATT GGGTTATGTA ACCGATTTTT ACAACACCCA AGATGTCAAA CTCAACGGTA TCAATATCTT GGGTAGAATC TACTGGGAAG GCATCAGCGA TAGGCTGCCT GAAGGCTTGT ATTCCACACT TGCTTACAAC CGTATCAATA TCAAAGCACG CAAATTGCAC GACAATTTTA CCAATGTGTC TGAGCCGACA TTGGAAGCCG TGCAACCGGG ACGCATTATT GCAAGTATCG GCTATGATGA CCCTGAGGGC AGATGGGGCC TTAATTTAAG CGGCACCTAC TCTCAAGCCA AACAACGTGA CGAAGTGGTC GGCGAAAAAG TGTTCGGCAA GGGTGGCAGC ATTAAACGGA CGATCAACAG CAAACGCACT CGCGCTTGGT ATATTTATGA TTTGACGGCA TACTACACTT GGAAAGAAAA ATTCACGTTG AGAGCCGGTA TCTATAATTT AACCAATCGT AAATATAGTA CATGGGAAAG TGTGCGTCAG TCCGCTGCCA ATGCGGTCAA TCAAGACCTA GGTACACGTT CAGCACGTTT TGCCGCACCG GGCAGAAACT TTACCGTGAG TATGGAAATG AAGTTTTAA
|
Protein sequence | MANRSALLRK HDPSIFYSHG MKKMSTKPLF KLKPITLAIS TIFLPFTEAV ADTESPSSNT EAVLELEAIQ VQAKHEISRH DNEVTGLGKV VKSSEDIDKE LILNIRDLTR YDPGISVVEQ GRGATSGYAM RGVDRNRVAM LVDGLGQAQS YSTLKSDANG GAINEIEYEN IKSIELSKGS SSAEYGSGAL GGAVGFRTKE ADDVIKEGQN WGLDSKTAYS SKNSQFIQSV AGAFRVGGFD SLAIFTHRKG KETRVHPAAE EIQHTYQPLE GYFNRYEVDQ NSNGTPVRAN AYYILADECS NLSDPSCRHA KAKTNRVGAP ENNPNWTPEE QAQAAKMPYP TRTASAKDYT GPDRISPNPM DYQSHSFFWK GGYRLSPNHY VGGVLEHTKQ RYDIRDMTQR AYYTKEDICH SGSSCQTLDK NETDKGNFGI TLTDNPLDGL VYDAGNQARG VRYGRGKFFD ERHTKNRSGI FYRYENPDKN SWADSLTLSI DRQDLKLSSR IHWTYCTDYP HVARCRASLD KPWSNYRTEK NDYQERLNLG QFNWEKTFNL GFTTHKVNIA AGFGTHRSTL QHGDLYAEYV TLPPYEEIKA YDDKGAFKTD TTPEDKLQYG NGSYDKPRVY RRKNTPELKT VNGCNETAGD NRDCSPRVIT GRQYYLALRN HIAFGEWADL GLGVRYDNHT FRSNDPWTKG GNYHNWSWNA GVSLKPTRHF VVSYRVSSGF RVPAFYELYG VRTGASGKDN PLTQKEFLSR KPLKSEKAFN QEIGLAVQGD FGVIETSFFQ NNYKNLLARA DKHVEGLGYV TDFYNTQDVK LNGINILGRI YWEGISDRLP EGLYSTLAYN RINIKARKLH DNFTNVSEPT LEAVQPGRII ASIGYDDPEG RWGLNLSGTY SQAKQRDEVV GEKVFGKGGS IKRTINSKRT RAWYIYDLTA YYTWKEKFTL RAGIYNLTNR KYSTWESVRQ SAANAVNQDL GTRSARFAAP GRNFTVSMEM KF
|
| |