Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4681 |
Symbol | |
ID | 5736528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5978106 |
End bp | 5979182 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281845 |
Product | solute-binding protein |
Protein accession | YP_001547440 |
Protein GI | 159901193 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGCA GGTTTGTGCT ATTGGTACTT TTGCTGGTGC TAGTTGGATG TGGTGGAGCG GAAGCGCCCA AGGCCAAAAT TATTGCGCTG TTGTTGCCAG AGCAAACCAC CAAACGCTAT GAAACTGTGG ATCGGCCATG GTTCGAGCGC GAAATGAATT TGCTCTGCAA CGATTGTCAG GTGTTGTACT ACAACGCCCA AAATAACCCA AGCTTGCAAC AACAGCAGGC CGAAGAGGCG ATCAAGGCTG GGGCACGGGT GTTGGTGCTT GATCCAGTCG ATTCGGTCGA TGCTCGCACG ATTGCTGATC ATGCTGCTGA GCAATCGATT CCAGTTGTGG CCTATGATCG GCTGATTCTT AATTCGCCTG GCGTTACGGC CTATATCTCG TTTGATAACC AAAAAATCGG CGAATTACAG GCCGAAAGCT TGATTGCCGG CCTTGCCGCG CGTGGCCTGA GTAATCCCAA AATTCTGCTG CTTCACGGCT CGCTGAGCGA TAACAATGCC AGCGAGTACA AGCGTGGAGC CAAAAAGGTT TTTGATCCAT TAGTCGCTGC TGGCAAATTG ACGATTATTG GCGAGTTTGA TACGCCCGAT TGGAAGCCCG CCGAGGCGCA GCAATATGTT GAACGAATGC TGGCAGCTGG CGATCAGATC GACGGAATTT ATGCGGCCAA TGATGGTACT GCTGGTGGTG CATTAGTCGC TGTCCAAGCA GCCAAGCTTG AGCCATTGCC CTTGATTACT GGCCAAGATG CTGAACTCAC CGCCGTGCAG CGGATCATCA CTGGTGAGCA ACATATGACG GTGTATAAGG CAATTCGGCC TCAAGCCGAG GCTGCCGCCA AAATCGCTCA TGCCTTGATG GTTGGTCAGC CCATTCCAAC CAACTTGGTC AACAATCGCA CGGTGGCAAA CGGAATTATG AATGTTCCAG CGATTTTGCT CAATCCAGTT GTGGTCACCA AAGCCACGGT CAAAGATACC ATTGTGAGCG ATAATTTTTG GTCGCCGCAA CAACTTTGCC CAGTCAAATT AGTACCCGCT TGTGAAGCAG TAGGCATCAA GCCCTAA
|
Protein sequence | MRRRFVLLVL LLVLVGCGGA EAPKAKIIAL LLPEQTTKRY ETVDRPWFER EMNLLCNDCQ VLYYNAQNNP SLQQQQAEEA IKAGARVLVL DPVDSVDART IADHAAEQSI PVVAYDRLIL NSPGVTAYIS FDNQKIGELQ AESLIAGLAA RGLSNPKILL LHGSLSDNNA SEYKRGAKKV FDPLVAAGKL TIIGEFDTPD WKPAEAQQYV ERMLAAGDQI DGIYAANDGT AGGALVAVQA AKLEPLPLIT GQDAELTAVQ RIITGEQHMT VYKAIRPQAE AAAKIAHALM VGQPIPTNLV NNRTVANGIM NVPAILLNPV VVTKATVKDT IVSDNFWSPQ QLCPVKLVPA CEAVGIKP
|
| |