Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4338 |
Symbol | |
ID | 5736198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5545782 |
End bp | 5547017 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281499 |
Product | inner-membrane translocator |
Protein accession | YP_001547098 |
Protein GI | 159900851 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4214] ABC-type xylose transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA CGCAACCGAC GAATCTTGCT GCTGCGCCCA ACAATTGGCG CGAAGCATTA GCCGAACGCT TTCGCCAAGG CAACCTAGGC TCGCTGCCAG TGATTATTGG CTTGATTATT ATTGCATTGG TGTTTCAAAG TATCAATAAA AACTTTCTTA CGCCGCTCAA CCTGACCAAC TTGATGGTGC AAATCGCCGC GATGGGCACA ATTTCAACGG GAGTTGTGCT AATTTTGCTG CTAGGTGAGG TTGATCTTTC GGCAGGCCAG GTTAGTGGCT TGGCGGCGGC GGTGATGGCG GTGTTGGTTT CGCGCCATAA TTTGCCTGCT GCCGTGGCGA TTATTGGGGC AATTGTGGTT GGAGCCTTGG TTGGCTTGTT GCAAGGTTGG TGGATTTCAA CCTTCCGCGT GCCCTCGTTT GTGGTTACGC TTGCAGGTTT GCTAGCTTGG CAAGGCTCGC GATTACGAGT GCTTGGTGAC ACAGGCAGCA TCAACATCAC CAATAAATTT ATTAATGATA TTGCCAACTA TAAACTGCCA ATTTGGCTTG GCTGGGTTTT GGGGATTGTC AGTGTTGTCG TTTATACCTT GATTGTGTTC AACGAATATC GCAGCCGCCG CGCTGCTGAA TTGCCAACTG GCTCGTTGAA CGGCGTATTT TGGCGGGTTG GGGTGGTTGG CGCGAGTGTC TTGGCGGGTG TTGCGATGAT GAGCGTCAAC CGCAATGCCA ACGCTGCGGG TAACCCAATC CAAGGCGTGC CTAGCGCAGT CATTATCTTC CTAACGTTTT TGATTATCTT TGATTTTATT ACCCAGCGCA CCCGTTTTGG CCGCTATGTT TATGCGGTTG GCGGCAATAC CGAAGCTGCT CGCCGCGCAG GAATCAATGT TAATCGGATT CGGATTACGA TTTTTATGCT GGCTTCGGCA CTCGCCGCCT GTGGCGGGAT TTTGGCCGCC TCACGTTTGA ACGCGGCCAA TCAATCATCA GGCGATGGCG ACGTGCTTTT GAACGCAATC GCCGCAGCGG TGATCGGTGG CACCAGCCTG TTTGGTGGGC GTGGCCGAAT TTGGTCGGCT TTGCTGGGCG CACTAGTGAT TGGGGCAATT GCCAATGGCA TGGATTTACT GGCGCTCAAG TCATCGATCA AATTTATCGT AACTGGCTCA GTGCTTTTGT TGGCAGTCAC GATTGATGCC GCCTCACGCG CCCGCCGCGA AAACAGTGGA CGTTAG
|
Protein sequence | MMKTQPTNLA AAPNNWREAL AERFRQGNLG SLPVIIGLII IALVFQSINK NFLTPLNLTN LMVQIAAMGT ISTGVVLILL LGEVDLSAGQ VSGLAAAVMA VLVSRHNLPA AVAIIGAIVV GALVGLLQGW WISTFRVPSF VVTLAGLLAW QGSRLRVLGD TGSINITNKF INDIANYKLP IWLGWVLGIV SVVVYTLIVF NEYRSRRAAE LPTGSLNGVF WRVGVVGASV LAGVAMMSVN RNANAAGNPI QGVPSAVIIF LTFLIIFDFI TQRTRFGRYV YAVGGNTEAA RRAGINVNRI RITIFMLASA LAACGGILAA SRLNAANQSS GDGDVLLNAI AAAVIGGTSL FGGRGRIWSA LLGALVIGAI ANGMDLLALK SSIKFIVTGS VLLLAVTIDA ASRARRENSG R
|
| |