Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0298 |
Symbol | |
ID | 5732193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 355734 |
End bp | 357194 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277422 |
Product | extracellular solute-binding protein |
Protein accession | YP_001543078 |
Protein GI | 159896831 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGCT CACGCCGTCC GTTTATTACT TGGCTAAGCC TCTTAACTCT GATTTCAATG ATCTTGGTTG CTTGTGGTGG AGCTGAGACC GCTCCCACTG CAACCACTGC TCCAGCTGCT ACCGCTACGA CTGCGGCTCA AGTTGAACCA ACTGCTGCTG ATGCTCCAAC CGAAGTTGCC GCAACCGCTG AACCAGCAAC CGCTGAGCCA GCTGGTGGCG ATGTTGTTAC CTTGAAGCTG TGGCACATGC CTAACGGTGC TGCTCCAGCC GATGCCATCC AAGCTGAAAT CGATGCCTTC GAAAAAGCTA ACCCCGGCAT TAACGTTGAG CCAGAAATGC TCGACTGGGG TGCAGCTTTC CAACGCATCC AAACCTCGGT TCAAGGTGGT GAAGGCCCAT GTATTACCCA ATTGGGTACG ACCTGGAACC CAACCTTTGC GGCAATGGGT GGCTTACGCC CATTCACCGA AGAAGAAATC ACCGCTATGG GTGGCAGCGA TAGCTTCGTC GCCGCTTCAT GGGCAACCTC ACAATTGCAA GGTATGGAAG GTACATTCTC GATTCCATGG TTTGCTGATG TTCGCGCCTT GGCCTATCGC AAAGACTTGT TGGAAAAAGC TGGCTTGAAG CCAGAAGAAG CCTTCAAGGA TTGGGCCAGC TTCAAAACGA CCTTGGCTAC CATCCAAGAA CAAAATCCTG ACGTTGCTGG GATCGCTTTC CCAGGCCGCA ACGACTGGAA CGTTTGGCAA AATAGCTCAA TGTGGATTTG GAACAGCGGT GGCGATTTGT TGAGCAGCGA TCTCAGCGAA GCAACCTTCA ACTCAGAAGC AGCTGTGGCT GGGGTTTCAG AATTTGCTAA CCTCTACAGC AGCAAATTGA CCGTTACCAA TACCTTGGAA TTGAACTCAG CCCAAGTTGA TGCCAGCTTT GGCGATGGCC GCACCTTTAG CGCAATCACT GGCCCATGGT TGATCAGCAA CGCTCGCACT GCTGCCGACG CTGGTGGCTG GGCTAACCGC ACCGTGGCCG ATAACTTGGC CTACGCCGAA TTCCCAGCTG GCCCTGGTGG CTCATACACC TTCGTTGGTG GTAGCAACTT GGCCATCTTG AAGAGCTGCG AAAACGCCGA TGCCGCTGTC AAGTTCGTGC AATTCTTGGC TGCTAACGAA TCACAATTGC GCTATAGCCA AGCCATCGGG ATGTTGCCAG CAACCAAGAC CGCTCAAGCC GATGCTAGCA TCGCCAGCGA TGCCTTGTAC AGCGTCTTCA TCGCCGCTGC TGCTAAGGGC AAAACCTCAG CTCCAATCGC TGAATGGGGT CAAGTCGAGA GCGTTCTCAA CGAACAACTT GGTTCACTCT GGGACGATGT AGCAACCGCT GGCGGTCCAG TTAGCGCTGA AGTCGTCAAG ACCCGCTTGG ATCAAGCTGC TCAAACTGTC AACGAATTGT TGGGTAACTA A
|
Protein sequence | MQRSRRPFIT WLSLLTLISM ILVACGGAET APTATTAPAA TATTAAQVEP TAADAPTEVA ATAEPATAEP AGGDVVTLKL WHMPNGAAPA DAIQAEIDAF EKANPGINVE PEMLDWGAAF QRIQTSVQGG EGPCITQLGT TWNPTFAAMG GLRPFTEEEI TAMGGSDSFV AASWATSQLQ GMEGTFSIPW FADVRALAYR KDLLEKAGLK PEEAFKDWAS FKTTLATIQE QNPDVAGIAF PGRNDWNVWQ NSSMWIWNSG GDLLSSDLSE ATFNSEAAVA GVSEFANLYS SKLTVTNTLE LNSAQVDASF GDGRTFSAIT GPWLISNART AADAGGWANR TVADNLAYAE FPAGPGGSYT FVGGSNLAIL KSCENADAAV KFVQFLAANE SQLRYSQAIG MLPATKTAQA DASIASDALY SVFIAAAAKG KTSAPIAEWG QVESVLNEQL GSLWDDVATA GGPVSAEVVK TRLDQAAQTV NELLGN
|
| |