Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1033 |
Symbol | |
ID | 5732937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1178881 |
End bp | 1180164 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278168 |
Product | ABC transporter related |
Protein accession | YP_001543809 |
Protein GI | 159897562 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGA TTGCGATTCA GGTTGAGCAT TTAAGTAAGC AATTTCGCAT TGGGGCGAAT GCATTTCAAA AAACCTTTCG CGAAACAGTG TCGGATATGA TGTTAGCGCC AGCCCGCCGC TTGCGTTCAG CCATGCGTGG TCAAGGCAGC CAAGCTGCGA CCAAGGAATT TTGGGCGCTG CGCGATGTTT CATTCGATGT TAAGCAGGGC GAAGTCGTTG GGATTATCGG CCACAACGGC GCGGGCAAAA GTACCTTATT GAAAGTGCTT TCGCGGATTA CCGAGCCAAC CACTGGCTCA GCCTTGATTC GTGGGCGGGT TGGCACGCTG CTTGAAGTTG GCACGGGTTT TCATCCCGAC CTCACAGGTC GCGAAAATAT GTATCTCAAC GGAGCGATTT TGGGCATGAG CCGCGCCGAA ATTAACCGTA AGTTCGATGA AATTGTGGCC TTTGCTGAGG TCGAGCAATT TATCGACACC ATGGTTAAAC ATTATTCCAG CGGCATGTAT TTGCGCTTAG CCTTTGCGGT GGCCGCCCAC CTTGAGCCAG AAATTCTGAT TGTCGATGAG GTTCTGGCGG TTGGCGATGC TCAATTTCAA AAGAAATGCT TGGGCAAAAT GGGCGAAGTT GCCCAGAATG GCCGGACTGT GCTCTTTGTA AGCCACAATA TGGCGGCAAT TCGCAGCTTA TGTCAACGAG TGGTTTGGCT CAACCAAGGC ACTGTGCTCA AAGATGGCCC TTCGGCAGCA ATTGTCAACG AATATTTGAT GCAAACCGTC ACTTCAAGCA GCTCTCGCGT CGATCTGCGC GAGGCAACTC GCCATTATGA TTATGGTAAG CGCTTTAAGA TTAATGACCT GACCTTTAAC CACGGCGAGC CAATTTTGCA TGGCGAGGTG CTCAACGTCA GCTTCAACTA CGAAGCCTAT GCCGATGTCT ATGGCGTGTC GTTTGGCTAT GGGTTTTCCT CACTCGAAGG CACGCGCCTA ATGACGGTCG ATAGCGATTT GAATGCACCA CGCTACCTTA TTAAGCGCGG CCAGCACGGC CAACTCACCA GTACCCTCGA TACGCTCAAC TTGCAACCTG GCATCTATTT GCTTGATGTT GGGGTGCGTT CAGGTGATGG TAGTGCGTTG GATTACCTGC CTGGATGTGC TCAGGTTGAA ATTTTGCCTG GCCCGACTAC CCCAGCTTCA ATGAGCCGCT TGGATTATGC CGGTAATGTA CGGCTTGGCG GGCAGTGGCA ATGGCCTGAG ACGGCCCAAG AGGATCGCGA TTAA
|
Protein sequence | MSEIAIQVEH LSKQFRIGAN AFQKTFRETV SDMMLAPARR LRSAMRGQGS QAATKEFWAL RDVSFDVKQG EVVGIIGHNG AGKSTLLKVL SRITEPTTGS ALIRGRVGTL LEVGTGFHPD LTGRENMYLN GAILGMSRAE INRKFDEIVA FAEVEQFIDT MVKHYSSGMY LRLAFAVAAH LEPEILIVDE VLAVGDAQFQ KKCLGKMGEV AQNGRTVLFV SHNMAAIRSL CQRVVWLNQG TVLKDGPSAA IVNEYLMQTV TSSSSRVDLR EATRHYDYGK RFKINDLTFN HGEPILHGEV LNVSFNYEAY ADVYGVSFGY GFSSLEGTRL MTVDSDLNAP RYLIKRGQHG QLTSTLDTLN LQPGIYLLDV GVRSGDGSAL DYLPGCAQVE ILPGPTTPAS MSRLDYAGNV RLGGQWQWPE TAQEDRD
|
| |