Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3641 |
Symbol | |
ID | 5735502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4579266 |
End bp | 4580600 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280790 |
Product | extracellular solute-binding protein |
Protein accession | YP_001546405 |
Protein GI | 159900158 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGTTC GTCGCAAGAT TCATGGTTCG TTAATTAGCC TGTTGGTATT GACGCTGATT TTGGCTGCCT GTGGCAGTGA TAGCACCGCA ACCACTACCA GCAACACCGG CTCGACCAAT CCCGCCGAAG TTAAGGGCGA AATTACGGTT TGGGCTTGGA ATGTGGCCGC CAAGAGCCTC GAAGCAACGG TTCCCTCGTT TAACCAAAAA TACCCCAACG TCAAAGTTAC TGTCCAAGAT ATTGGACGCA CAGATGTCTA CGATAAGCTA ACTTCGGGCT TACAAGCTGG CGGCGCGGGC TTACCCGATG TAGTCGCGAT CGAATCCGAC CGCATGGATG TCTATACGTC AACCTTCCCC GATGGCTTGG CCGACTTAAC CAGCCGCGCC AGCAAATACG AAAAAGATTT CGATCCTTCC AAATGGGCCC AATCCAAAAT TAGCGATAAA ATTCGCTCGA TTCCCTGGGA TTCCGGCCCA ACTGGTTTGT GGTATCGCGT TGATATTTTC GAGCAAGCTG GCGTAGATCC TAAATCGATC GAAACATGGG CCGATTTGAT CGCTGCTGGC GAAAAGATTT TGGCCGCGAC CGATGGCAAA ACCAAGCTCT TGCCAGTCGA TATTGTGGCC GATGATGCTG GCTTCCGCAT GATGACCAGC CAATTGGGCG TATGCTGTTA TTTCAATAAC GATGGCAAAA TCAACCTGAC CAATGACAAA TCGGTACAAG CGCTGACCTT GCTCAAAGAA ATCAACGATA AAGGCTTAGT TGCCAATATC AATGGTTGGG ATGGTACAGT TGCCGCGACC AAGAATGGCG ATGTTGCCAC GGTTCCATTT GGGGTCTGGT ATAGCGGCAC AATCATCGAC CAAGCGCCTG ATCTTTCAGG CAAATGGGAT GTGATGTTGT TGCCAGCCTT CGAAAAAGGC GGCAATCGCG CTGCCAACCT TGGTGGCTCG ACCTTGGCAA TTCCGGCTGC AACCAAAAAT CTCGATGCAG CTTGGTTGTT CGTTGAGCAT GCCTTGGCCA CCAGCGAAGG CCAAAACATT ATGATGGAAA AATTCGGCAT TTGGCCAAGC TATCAGCCAG CCTACAGCGC TGACCTCTAT AGCAAGCCAG TGGCTTTCTT CAACAACCAA CCAATTTGGA AGTTGTTTGC TGATGAAATT AAGAACATTC CACCAGCAAC CTACACCAAA GACTATGCTA AAGGCCAAGC AGTTTTGGCT TCAGCTCAAG CCAAAGTGCT GAGCCAAGGC ATGGACCCCA AACAAGCTCT GCAAGAAGCT GCTGCCGAAT TGGCCAACCA AACTGGCCGC GAAATCGCCC AATAG
|
Protein sequence | MMVRRKIHGS LISLLVLTLI LAACGSDSTA TTTSNTGSTN PAEVKGEITV WAWNVAAKSL EATVPSFNQK YPNVKVTVQD IGRTDVYDKL TSGLQAGGAG LPDVVAIESD RMDVYTSTFP DGLADLTSRA SKYEKDFDPS KWAQSKISDK IRSIPWDSGP TGLWYRVDIF EQAGVDPKSI ETWADLIAAG EKILAATDGK TKLLPVDIVA DDAGFRMMTS QLGVCCYFNN DGKINLTNDK SVQALTLLKE INDKGLVANI NGWDGTVAAT KNGDVATVPF GVWYSGTIID QAPDLSGKWD VMLLPAFEKG GNRAANLGGS TLAIPAATKN LDAAWLFVEH ALATSEGQNI MMEKFGIWPS YQPAYSADLY SKPVAFFNNQ PIWKLFADEI KNIPPATYTK DYAKGQAVLA SAQAKVLSQG MDPKQALQEA AAELANQTGR EIAQ
|
| |