Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1738 |
Symbol | |
ID | 5733625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2024118 |
End bp | 2025209 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278880 |
Product | periplasmic binding protein |
Protein accession | YP_001544509 |
Protein GI | 159898262 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4594] ABC-type Fe3+-citrate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.028401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGTT TTACCCGTTT ACTCTCGTGT GGCTTGCTAC TGACGGTGTT AGCAGCCTGT GGCGGTGCCG CCAGCACCCC AACTAGCGCC CCAGCAGCGA CCGCGACCAC AGCTCCAACC GAGGCTGCTG CCGCTACGGC CACAACCGCG CCAACCGAAG CCGCCGTGGT CGAAGCAACT GCCACCGCTG CTAGCGATAC AACGACTACT TCTGGCAATC GCACCTTCAC CCATGCCTTG GGCGAAATCA GCATCCCCAA TGTTCCGCAA CGGGTGATCG CCCTTGATTG GATGTATTTG GAAGATGTGT TGGCCTTGGG TGTGCAACCA GTCGGTGCGA TCGACTTAGA AAATTACCCC AAGTGGGTCG ATTTGCCGTT GACAATTGAT CCCAGTGTCG TTTCAATCGG CGCAAATCCA GCACCCGATT TTGAATCAAT TGCTGCTTTG AAGCCTGACC TGATTTTGGT TGGCTCGTTG CGGGGCGAAA CGATTTACGA TCAATTGAAT GCCATCGCCC CAACCATGAT GTTCAATCCC TATTTGAAAG AGGGCGAAGG CCAGCCATAC GACGAAATGA CCACGACGTT CAGCACAATT GCCGCAGTGT TGGGCAAAGA AACCGAAGGC GCAGCCGTGT TGAGCAAAAT GGAACAAACC TTTGCTGATT CCAAAGCACA ACTGGCAGCA GCTGGTTTGG CCGATGCTAA CGTGTTGGTG GCGCAAACCT ATGCCAGCAA CAACGCTGCT GAAGTGCGTT TGTTCACCGA TAACGCCGCC GTCATGGAAA TTATCAAGCG TTTGGGCTTG AAAAACGCTT GGTCAGACCC AACCTATCAA GTTTGGGGCT TCTCATCGGT TGGCACTGAA GCGTTGGCTC AGTTTGGTGA TGTGCATATG CTTTACATAA CCGAAGAAGA TAATGATCCC TTCCAGAGCG CCGCGATCAA GCCCTATTGG GATAGCTTGG AGTTTGTCAA AGCTGGCCAA GCACATACCT TGGATAGCCA AACTTGGACC TTTGGTGGCC CAATTGCCGC CGAAGTGTTT GCCCAACGGG TCACCGAAGC GCTCTTGTCG GAAAGCAACT AA
|
Protein sequence | MVRFTRLLSC GLLLTVLAAC GGAASTPTSA PAATATTAPT EAAAATATTA PTEAAVVEAT ATAASDTTTT SGNRTFTHAL GEISIPNVPQ RVIALDWMYL EDVLALGVQP VGAIDLENYP KWVDLPLTID PSVVSIGANP APDFESIAAL KPDLILVGSL RGETIYDQLN AIAPTMMFNP YLKEGEGQPY DEMTTTFSTI AAVLGKETEG AAVLSKMEQT FADSKAQLAA AGLADANVLV AQTYASNNAA EVRLFTDNAA VMEIIKRLGL KNAWSDPTYQ VWGFSSVGTE ALAQFGDVHM LYITEEDNDP FQSAAIKPYW DSLEFVKAGQ AHTLDSQTWT FGGPIAAEVF AQRVTEALLS ESN
|
| |