Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4906 |
Symbol | |
ID | 5736742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6243022 |
End bp | 6244095 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282073 |
Product | phosphate binding protein |
Protein accession | YP_001547664 |
Protein GI | 159901417 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.265302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTACAAC GATTTTTTTT GCTTTGCTGC CTCGTAACGT TGGCTGGATG TGGTGCAACT GCCGCCCAAC CGACGGCTAC AAGCCTCCCC GCGACAGCAA CACCCGCACC AACCACCCTC GCTCAAGCAA CTCCTGCAGT AACGGCGAAT CCAGCTTTGC GCGGCACAAT CGTGATCGAT GGCTCTAGCA CGGTGTTTCC AATTACTGAG GCAGTTGCCC GTGAGTTTGC CCTGACTGCG CCGAATGTTC AAGTGCAATT GGGCGTGAGC GGCACTGGTG GCGGGTTTAA GAAGTTTTGT GCTGGCGAAA CGGTGATCTC CGATGCTTCA CGCCCAATCA AGCAGAGCGA AGCCGCCGAG TGTGCCGCCA ACCAGATTGA TTTTGTGGAA ATTCCGGTGG CCTTCGATGG CCTCTCGTTG GTTGCCAACC CCAGCAATAC ATGGCTCGAA TGTATGACCG TGGCCGAATT GAACACGCTC TGGCAGCCGG ATGCGACCAA TATTATTACC AATTGGCGTA TGTTGCGACC CATTTGGCCA ACCAGTACCT TGCAATTGTA TGGCGCGGGT CAAGATTCAG GCACCTTCGA TTATTTCACC AGCGCAATTG TTGGAACTGA GGGTTCCAGC CGTAGCGACG TGATTAGCAG CGAAGACGAT TATCTGATTG CTCAGGATAT TGCGGGCGAC CCCAATGCTT TGGGCTATTT TGGTTATGCC TACTATCGCG AATATCAAGA ACGCCTGAAA CTAATTGCGG TCGATGCTGG CAATGGTTGC GTCCTGCCTT CTGAGCAAAC GATTGCTGAT GGCTCGTATC AACCACTTTC GCGGCCAATT TTCATTTATG TCCGCGCCGA TGCCTTAGAT CGGGCTGAAG TGGCGGCGTT TGTTGATTTT TATCTCAGTG ATTTGGCGCG GGTGGTGGCT GACGTGAAAT ATGTGCCCTT GCCAGCCCGG GCCTATCAAT TTGCCCAAGA GCGCGTGCAA CAACGCAAAC TTGGCTCGTT GTTCGAGGGT GGTTCGCAAA TCGGGGTTTC GATCGAACGC TTGCTTGAGC TAGAAGGACA ATAA
|
Protein sequence | MLQRFFLLCC LVTLAGCGAT AAQPTATSLP ATATPAPTTL AQATPAVTAN PALRGTIVID GSSTVFPITE AVAREFALTA PNVQVQLGVS GTGGGFKKFC AGETVISDAS RPIKQSEAAE CAANQIDFVE IPVAFDGLSL VANPSNTWLE CMTVAELNTL WQPDATNIIT NWRMLRPIWP TSTLQLYGAG QDSGTFDYFT SAIVGTEGSS RSDVISSEDD YLIAQDIAGD PNALGYFGYA YYREYQERLK LIAVDAGNGC VLPSEQTIAD GSYQPLSRPI FIYVRADALD RAEVAAFVDF YLSDLARVVA DVKYVPLPAR AYQFAQERVQ QRKLGSLFEG GSQIGVSIER LLELEGQ
|
| |