Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3997 |
Symbol | |
ID | 5735858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5102626 |
End bp | 5104581 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281147 |
Product | phosphate binding protein |
Protein accession | YP_001546757 |
Protein GI | 159900510 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAAAC TGCTGAGTTT AATGCTTGCC TCAATTATGT TGCTGACAAT GCTCGCAGCA TGTGGCGGCG ACAGCACTCC AACCACTGCA CCAACTACGG CACCAGCTAC CGCCACTACT GGCCAAGCTG CTGCAACAAC TGAACCAACC GCTCCTGCTG CTGAAACTCC TACAGTTGAA GCAACCGCTG AAGTTCCTGC TGGTGGAGAA GTTGACCCAG CAATGGTAAA AGGCGATATC GTGAGCGCTG GTTCATCAAC GGTCTATCCG TTGAGCGAAG CCGTGGCCGA AATCTTTACC GAAGATGGCT ACACTGGCAA TATCACGATC GATAGCATCG GCACGGGCGC TGGGTTCGAG CGCTTCTGTA CCGCTGCCGA AACCGACATC GCCAACGCCA GCCGCGCAAT CAAAGACGAA GAAGCCAAAG CCTGTGCCGA TAAAGGCCGC GAAGTCGTCG AGTTCCGCGT CGGTACCGAT GCCTTGGCCG TCGTGGTCAG CAGCAAAAAT ACCTTCGTCA GCAACTTGAC CGAAGCCCAA GTCGCCGACA TCTTCTCAGG CACCTACAAA ACCTGGGATC AAGTTGATGC CAGCTACCCA GCCGAAGCGA TCAAACTCTA CAGCCCAGGC ACCGATAGCG GTACCTTCGA CTACTTCGTC GAACATTTCT ACGCGAAAGA AGGTAAGTTC ATCTTGGGTG CAAACCCACA GTTGAGCGAA GACGATAACG TGTTGGTGAC CGGGATCGAA GGCGATGCCA ATGCGATCGG CTACTTTGGC TATGCCTACT ACAACGAAAA CAAAGCTAAG CTCAAAGCCT TGACGATTGA CGGTGTGGAA CCAACCGAAG CCACCACCGA AGATGGCAGC TATCCGTTGG CTCGTCCGTT GTACATCTAC TCAGCCAAGA ATATTTTGAC TGAAAAGCCT CAAGTCGCGG CTTTCATCAA CTACTACTTG ACCAACGTCA ACGATGTTAT TCTTGAAGTA GGCTACTTCC CAGCCAGCGA CGAAGCGTTG GGCGAAGCTA AAGACGCTTT GGTTAATGCC TTGACGGGTG GCAGCAGCAG CAATACCAAC ACTGGTAGCG CCGTTGCCCT CGAAGAAGTT GATCCAGCAG CAGTTCAAGG TGATATCGTG AGCGCTGGTT CATCAACGGT CTATCCGTTG AGCGAAGCCG TGGCCGAAAT CTTCGGCGAA GATGGCTACA GTGGCAATAT CACGATCGAT AGCATTGGCA CGGGCGCTGG GTTCGAGCGC TTCTGTACCG CTGCCGAAAC CGACATCGCC AACGCCAGCC GCGCAATCAA AGACGAAGAA GCCAAAGCCT GTGCCGATAA AGGCCGCGAA GTCGTCGAGT TCCGCGTCGG TACCGATGCT TTGGCCGTCG TGGTCAGCAG CAAAAATACC TTCGTCACCA ATTTGACCGA AGCTCAAGTC GCTGACATCT TCTCAGGCAC CTACAAAACC TGGGACCAAG TTGATGCCAG CTACCCAGCC GAAGCGATCA AACTCTACAG CCCAGGCACC GATAGCGGTA CTTTCGACTA CTTCGTTGAA CATTTCTACG CGAAAGAAGA AAAATTCATG TTGGGTGCAA ACCCACAGTT GAGCGAAGAC GATAACGTGT TGGTGACCGG GATCGAAGGC GATGCCAATG CGATCGGCTA CTTTGGCTAT GCCTACTACA ACGAAAACAA GAGCAAACTC AAAGCGTTGA CGATTGATGG TGTGGAACCA ACCGAAGCCA CCACCGAAGA TGGCAGCTAT CCGTTGGCTC GTCCGTTGTA CATCTACTCG GCTAAGAACA TCTTGGCTGA AAAAGCCCAA GTCGCGGCCT TTATCAACTA CTACTTGACC AACGTCAATG AAGTTATCCT CGAAGTGGGC TACTTCCCAG CGAGTGAAGA AGCATTGAAC GAAGCCAAGC AAAACCTGCT CGACGCAACC AAGTAA
|
Protein sequence | MRKLLSLMLA SIMLLTMLAA CGGDSTPTTA PTTAPATATT GQAAATTEPT APAAETPTVE ATAEVPAGGE VDPAMVKGDI VSAGSSTVYP LSEAVAEIFT EDGYTGNITI DSIGTGAGFE RFCTAAETDI ANASRAIKDE EAKACADKGR EVVEFRVGTD ALAVVVSSKN TFVSNLTEAQ VADIFSGTYK TWDQVDASYP AEAIKLYSPG TDSGTFDYFV EHFYAKEGKF ILGANPQLSE DDNVLVTGIE GDANAIGYFG YAYYNENKAK LKALTIDGVE PTEATTEDGS YPLARPLYIY SAKNILTEKP QVAAFINYYL TNVNDVILEV GYFPASDEAL GEAKDALVNA LTGGSSSNTN TGSAVALEEV DPAAVQGDIV SAGSSTVYPL SEAVAEIFGE DGYSGNITID SIGTGAGFER FCTAAETDIA NASRAIKDEE AKACADKGRE VVEFRVGTDA LAVVVSSKNT FVTNLTEAQV ADIFSGTYKT WDQVDASYPA EAIKLYSPGT DSGTFDYFVE HFYAKEEKFM LGANPQLSED DNVLVTGIEG DANAIGYFGY AYYNENKSKL KALTIDGVEP TEATTEDGSY PLARPLYIYS AKNILAEKAQ VAAFINYYLT NVNEVILEVG YFPASEEALN EAKQNLLDAT K
|
| |