Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2204 |
Symbol | |
ID | 5734091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2800811 |
End bp | 2801818 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279345 |
Product | monosaccharide-transporting ATPase |
Protein accession | YP_001544972 |
Protein GI | 159898725 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00482964 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGC CAACCAACAA GCCGCAAGTT CTGGTAAACT CCGTCGGATA TTCCCGCCGT GATTTTCTGC GTTTCGGGGC GTTTACCACT GTCGGGCTAC TCGCTGGCTG TGGAACCACC AGCGCCACGC CGGTAAGCAC CGCCACCCCA ACTAAGCCCA AAATTGGCTT GGTCATGAAA TCGCTGAGCA ACGAGTTTTT CCAACAAATG CAGGCTGGCG CTGAAGAATA TGCCCGCCAA AATGCAACCC GCTTTGATTT CACCGCCACC GGCATTAACG ATGAGCGCGA TTTCGCCACC CAAATTGCTT CATTCGAGCG CTTAGTTAAT GAGCAATATG ATGTGATTGT GTTGGCTCCC GCCGATTCGA TTGCCTTGGT TGCCCCGGTT GCCAAAGCGG TCAAAGCTGG CATCGTCGTG ATCAATATCG ACGTTGCGCT TGATGAAGCA ACCAAAAAAG CTGCTGGCAT CGATCTGGCC TTTTTTGGCC CCGATAATCG TGCCGGAGCC AAAATGTCGG GCGAGGTGCT GGCCAAAGCC TTAGGCGCAG GCGGCAAAGT CGCAGTGCTC GAAGGCAATC CAGAGGCCGA TAATGCCGTC CAACGCCGCT TAGGCTTCGA CGATGCCATC GCCGATGGCA GCTTAAATTT AGTTGTTGCT GAAAGTGGCC ACTGGGAAAC CAGCGAAGGC CAAAGCATTA CCGCCGCATG GCTCAAAAAA TATCCCGACT TACAAGGCAT CATGTGCGCC AACGATTCAA TGGCTTTTGG GGCAGTCCAA GCACTCGAAG CCGCCAATCT GCTCGATAAA ATCAAAGTCG TGGGCTTTGA TAACATTCCT GCTGTGCAAC CCTTGATCAA AGATGGTAAA ATGCTGGCGA CCGTTGAACA ATACGGTGCG CAAATGGCAG CAATCGGTAT GGATTATGGC TATCGCACGC TCAAAGGCGA GAAATTTAGC GGCTGGATTC GCACCGAGCT AAAATTAATC ACTAAAGATA ATCTCTAG
|
Protein sequence | MEKPTNKPQV LVNSVGYSRR DFLRFGAFTT VGLLAGCGTT SATPVSTATP TKPKIGLVMK SLSNEFFQQM QAGAEEYARQ NATRFDFTAT GINDERDFAT QIASFERLVN EQYDVIVLAP ADSIALVAPV AKAVKAGIVV INIDVALDEA TKKAAGIDLA FFGPDNRAGA KMSGEVLAKA LGAGGKVAVL EGNPEADNAV QRRLGFDDAI ADGSLNLVVA ESGHWETSEG QSITAAWLKK YPDLQGIMCA NDSMAFGAVQ ALEAANLLDK IKVVGFDNIP AVQPLIKDGK MLATVEQYGA QMAAIGMDYG YRTLKGEKFS GWIRTELKLI TKDNL
|
| |