Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1525 |
Symbol | |
ID | 5733412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1778227 |
End bp | 1779297 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278665 |
Product | PTS system, mannitol-specific IIC subunit |
Protein accession | YP_001544297 |
Protein GI | 159898050 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2213] Phosphotransferase system, mannitol-specific IIBC component |
TIGRFAM ID | [TIGR00851] PTS system, mannitol-specific IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.003101 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACAGC ATATTCAGCG CTTCGGCAGC TTCTTGGCTG CAATGGTTGT TCCCAATATC GGTGCGTTCA TCGCTTGGGG CTTAATCACC GCTCTGTTCA TCCCAACTGG TTGGATTCCG AACGAAACTT TTGCCAAGCT CGTTGGCCCA ATGATTACTT ACCTTTTGCC CACCTTGATT GGTTATACTG GCGGTAAGAT GATTCACGAT ACCCGTGGCG GGGTGGTTGG GGCCGCCGCA ACCATGGGTG TAATCGTCGG AGCCGATATT CCAATGTTTA TCGGGGCCAT GGTCATGGGG CCACTTGGCG GCTACCTGAT GAAGAAAATT GACGAATTCC TCGAAGATCG CGTGCCAACT GGCTTTGAAA TGTTGGTCAA TAACTTCTCG GCAGGCTTCC TTGTGATGGG CTTGGTGCTA TTGGCGAATG CGGCGATTGG CCCTGTTGTC CAAGTGTTAA GTGAGGGCTT AGGCTCTGGC GTTCAATTCC TGATCGACTC ACGCTTGTTG CCTGTGGCCG ATATTATTAT TGAACCAGCC AAAGTACTCT TCCTCAATAA CGCCATCAAT CACGGGATTC TTGGCCCCTT AGGCGTGGCT GAAGTTGGCG AAAATGGTAA AGCGATTCAC TTTTTGCTCG AAAGTAACCC CGGCCCTGGC TTAGGCTTGC TCTTGGCCTT CTGGTTTGCT GGCAAAGGCA ACCTCAAAGA ATCAGCACCA GGTGCTGCAG TAATTCACTT CCTCGGTGGG ATTCACGAAA TCTATTTCCC CTACATTTTG GCTCGTCCTG TGATGATTGT GGCAATGTGG GCTGGCGGGA TCTTAGCCGA CCTGTGGTTT GTGATTTTTG ATGCAGGTTT GGTTGCTACG CCATCACCTG GCTCGATCTT TGCCTACTTG GCGGTTATTC CACGTGGCGG TCACTTTGCG GTCTTAGGTG GGGTTTTCGT CGGGATTGTT GGCTCGTTTG TGGCTGGCTC GATGTTGCTC AAACTCTTCC CGGGCGAAGA AACTGTGGCA GATGAGCAAA CCAGTTCAGA AATGACTGGC ACCGCTGCAA CCGCCCACTA A
|
Protein sequence | MRQHIQRFGS FLAAMVVPNI GAFIAWGLIT ALFIPTGWIP NETFAKLVGP MITYLLPTLI GYTGGKMIHD TRGGVVGAAA TMGVIVGADI PMFIGAMVMG PLGGYLMKKI DEFLEDRVPT GFEMLVNNFS AGFLVMGLVL LANAAIGPVV QVLSEGLGSG VQFLIDSRLL PVADIIIEPA KVLFLNNAIN HGILGPLGVA EVGENGKAIH FLLESNPGPG LGLLLAFWFA GKGNLKESAP GAAVIHFLGG IHEIYFPYIL ARPVMIVAMW AGGILADLWF VIFDAGLVAT PSPGSIFAYL AVIPRGGHFA VLGGVFVGIV GSFVAGSMLL KLFPGEETVA DEQTSSEMTG TAATAH
|
| |