Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3659 |
Symbol | |
ID | 5735520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4599898 |
End bp | 4601037 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280808 |
Product | multiple sugar-binding periplasmic receptor ChvE precursor |
Protein accession | YP_001546423 |
Protein GI | 159900176 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATGA AGTCTCCTCG CGCGAAGTTC ATGATGGTTT TGATGTTGCT CATGACAATG GTTTTGGCCA GCTGTGGCGA AGCCGCAACT CCAACCACTG CGCCAACCGA TGGCGGCACT GGTGGCACAA CCGCCAGCAG CGATAACAGC AAATTCACCA TTGGGATCTC AATGCCCACC AAATCATCAG CCCGCTGGAT TGCCGACGGC GATAATATGG TGAAATATTT CCAAGAAAAA GGCTTCAAAA CCGACCTTCA ATACGCTGAA GACGATATCC CAACCCAACT TTCACAAATT GAAAACATGG TCACCAAAGG GGTCAATGTT TTGGTGATTG CGGCAATCGA TGGCGAAACC CTCTCAGATG TTCTGCAAAG TGCTAAAGAC AAGAAAATTC TGGTTATCGC CTACGACCGC TTGATCAAGA AAACCCCCAA CGTTGACTAC TACGCCACCT TCGATAACTT CCAAGTTGGG GTCTTGCAAG CCCAATCAAT CGAAACCAAA CTTGGTTTGA AAGAAGGCAA AGGCCCATTC AATATCGAGT TGTTCGGTGG CTCATCCGAC GATAACAACG CCTTCTTCTT CTACAATGGC GCTATGTCGG TCTTGCAACC TTACATCGAT AGCGGCAAGT TGGTGGTTGG TAGCGGCCAA ACTGGTATGG ATAAAGTTGC CACCCTGCGC TGGGACGGTG CTACCGCCCA ATCACGCATG GACAACATTT TGAGCGCCTT CTACGGCGAC AAACGGGTTG ATGCAGTGCT TTCACCATAC GATGGTATCA GCATCGGGAT CATCTCATCG CTCAAGGGTG TTGGCTACGG TAGCGCCGAC AAACCAATGC CAGTCGTTTC AGGCCAAGAT GCTGAAGTGC CCTCAGTCAA GTCAATTATC GCTGGCGAAC AAAGCTCAAC CATCTTCAAA GATACCCGCG AGCTTGCTAA ATCAGTGGTA GGTATGGTCG AAGCATCATT GTCAGGCAAA GAAGTTGCCG TCAACGATAC CAAAACCTAT GACAATGGGG TCAAAGTTGT TCCTTCACAA TTGCTGGTTC CGGTGGTCGT CGATGTAACC AACTGGGAAA AAGTATTGAT CGACAGTGGT TACTACAAAA AAGAAGACAT AACCAAATAA
|
Protein sequence | MTMKSPRAKF MMVLMLLMTM VLASCGEAAT PTTAPTDGGT GGTTASSDNS KFTIGISMPT KSSARWIADG DNMVKYFQEK GFKTDLQYAE DDIPTQLSQI ENMVTKGVNV LVIAAIDGET LSDVLQSAKD KKILVIAYDR LIKKTPNVDY YATFDNFQVG VLQAQSIETK LGLKEGKGPF NIELFGGSSD DNNAFFFYNG AMSVLQPYID SGKLVVGSGQ TGMDKVATLR WDGATAQSRM DNILSAFYGD KRVDAVLSPY DGISIGIISS LKGVGYGSAD KPMPVVSGQD AEVPSVKSII AGEQSSTIFK DTRELAKSVV GMVEASLSGK EVAVNDTKTY DNGVKVVPSQ LLVPVVVDVT NWEKVLIDSG YYKKEDITK
|
| |