Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4714 |
Symbol | |
ID | 5736557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6022295 |
End bp | 6023746 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281878 |
Product | PTS system, fructose subfamily, IIC subunit |
Protein accession | YP_001547473 |
Protein GI | 159901226 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component [COG1445] Phosphotransferase system fructose-specific component IIB |
TIGRFAM ID | [TIGR00829] PTS system, fructose-specific, IIB component [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGCA TTGTTGCGAT CACCTCATGT CCAACTGGTA TTGCCCACAC ATTTATGGCC GCCGAAGGGG TTCAACGTGG CGCAGAAGCC CTTGGTCATC AGATTAAAGT TGAAACCCAA GGTTCAGTTG GTGCGCAAAA TGTGCTGACT GAGGCCGATA TTCGTGAAGC CGATTTGGTG ATTATCGCCG CTGATACCAA AGTTGATCTT GGTCGCTTTG TAGGCAAGCG GGTTTATGAA ACCTCGACCA AAGCAGCGAT TACCGATGGT CAGGGAATGG TCAAAACCGC CTTTGATCAA GCCAAAACCT ATAACGCTAG CGGTTCAGCC AACCTTGCTG ATACCGTCGA TGATCTCAAA GCTCAACGCT CGGCCAGCCG CTCCGGTCCC TACAAACACC TAATGACTGG GGTTTCGTAT ATGCTGCCGT TTGTGGTTGC AGGCGGCTTG CTGATTGCCT TGGCTTTTGC GATTGGCGGC ATTTATGTAT ATGAAGATCA ATATGCCAAC ACGCTTGGTT GGTCGTTATT CCAAATTGGG GCTAAATCGG GCTTTGCGCT GATGGTGCCA ATTTTGGCGG GCTTTATTGC CTTCTCAATC GCTGATCGAC CTGGCTTAGT GCCTGGAATG ATTGGCGGGA TGTTGGCCAG CAGCAATGGC TCCGGCTTCT TGGGTGGCAT CATTGCTGGT TTTATTGCGG GCTACGCTAC CGATTGGCTG AACACCAACA TTCGTTTGCC AAAAACCTTG GCTGGCCTCA AACCTGTGTT AATTCTACCG TTGCTCAGTA CTGCAATCGT CGGTTTGTTG ATGATTTATG TGATTGGCAA GCCCGTTAGT GCGGTTAACA CCAGTTTGAA TGAGTGGCTG ACCGGGTTGC AAGGAACCAA CGCGATTTTG CTGGGCTTGT TGCTTGGGGC AATGATGGCC TTCGATATGG GTGGGCCAGT CAACAAGGCC GCCTATACCT TTGCGGTTGG ATTATTGGCT AGCAATGTGT ATGCTCCAAT GGCTGCCGTA ATGGCTGCTG GCATGACTCC GCCGCTTGGT TTGGCCTTGG CCTCGTTACT CTTCAAAAAT CGCTTTACCG CCGAAGAACA AGAAGCCGGC AAGGCAGCGG CAGTGCTTGG CATTTCGTTT ATCACCGAAG GCGCAATTCC CTTCGCAGCA CGCGATCCAT TCCGCGTGAT TCCAGCGATT ATGCTTGGCT CAGCCGTTAC GGGAGCACTC TCGATGAGCT TTGGGGCAAC CCTGCAAGTG CCACACGGCG GAGTGTTCGT GTTGCCAATT CCCAATGCAG TTGGTAGTTT GGGCTTGTAC ATTGTGGCAA TTTTGGTGGG CACAGTGGTG ACGGCTGCCG CCTTGTATGC GCTCAAACGC CCGTTGCTGC AAACTCCCGT TACTACTAGC GAAACCAATG CTAGTGCAAC TTCAGCGGTC AGCGTGCGCT AA
|
Protein sequence | MARIVAITSC PTGIAHTFMA AEGVQRGAEA LGHQIKVETQ GSVGAQNVLT EADIREADLV IIAADTKVDL GRFVGKRVYE TSTKAAITDG QGMVKTAFDQ AKTYNASGSA NLADTVDDLK AQRSASRSGP YKHLMTGVSY MLPFVVAGGL LIALAFAIGG IYVYEDQYAN TLGWSLFQIG AKSGFALMVP ILAGFIAFSI ADRPGLVPGM IGGMLASSNG SGFLGGIIAG FIAGYATDWL NTNIRLPKTL AGLKPVLILP LLSTAIVGLL MIYVIGKPVS AVNTSLNEWL TGLQGTNAIL LGLLLGAMMA FDMGGPVNKA AYTFAVGLLA SNVYAPMAAV MAAGMTPPLG LALASLLFKN RFTAEEQEAG KAAAVLGISF ITEGAIPFAA RDPFRVIPAI MLGSAVTGAL SMSFGATLQV PHGGVFVLPI PNAVGSLGLY IVAILVGTVV TAAALYALKR PLLQTPVTTS ETNASATSAV SVR
|
| |