Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2119 |
Symbol | |
ID | 5734007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2661056 |
End bp | 2662570 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279260 |
Product | ABC transporter related |
Protein accession | YP_001544887 |
Protein GI | 159898640 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.307687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATT CGCAGAGTCT TATCACCGTC AGAGGTATTT CCAAAGCGTT TCCTGGTGTC GTCGCCCTTG AACGTGTTGA TTTTACGCTG AGGAAAGGCG AAATCCACGC GCTCATGGGC GAAAATGGTG CCGGAAAATC TACCCTGATC AAAATTCTCA CTGGGGTGGA GCAGCCCGAC ACGGGCAGTA TTGAATTAGA GCACACGCTT ATCCGCATTC GGTCGCCGCA GCATGCCCAA CAACTTGGCA TCAGCACGGT GTATCAAGAG ATCAATCTGT GCACCAATAT TTCGGTCGCC GAAAATATTA TGCTCGGCCA TGAGCCACGG CGCTTTGGTG GCATCCACTG GAAAAAGCTG AACGAAATGG CCCGCCAAGC GCTCCTGCGC TTAGGCATCG AGCTGGATGT GACCCAACCA CTGGGAATGT ACTCGATCGC GATCCAGCAA ATGGTCGCGA TTACCCGGGC GCTTGAGATC GCGTCGGCCA AAGTATTGAT TCTGGATGAA CCGACATCCA GCTTGGATGC CAACGAAACT GCACGGCTTT TTGCTGTGAT GCGGGCGCTC AAGCAAGCGG GCATCGGGAT TGTCTTTGTC ACGCATTTCC TCGACCAAGT GTACGAAATC GTTGATCGAG TGACGATTCT GCGCAACGGC AGTTTCGTTG GCAGCTACGA CATTGCCGAT CTCCCACGGG TTGAACTGGT AGCAAAAATG CTTGGTCGCG TAGTTGGCGA ATTGCAAGCA TTGGCCCAAG ATAAAGCGCG AGGTGAGCGT CAGCCCGATG AACGACGGCT GATCGAAGCG GCTGGTCTGG GCCTGAGCGG CATGCTGGAG CCGCTTGATC TGGCTATTAA CGTCGGCGAA GTGCTGGGGA TTGCAGGGTT GCTCGGCTCC GGACGCAGCG AACTGGCAAG TTTGCTGTTT GGTCTGACTT CGCCTGATAG TGGCACGTTG CTGATCGATG GGCAGTCAGT TACGCGGTTT TCCCCACAAG AATCGATCAA GCGTGGGGTC GCTTTGTGTC CCGAGGATCG CAAGGCTGAG GGTATTGTAG GCGATCTTAG CATTCGCGAA AATATTATTT TGGGCTTGCA AGGGCGCTAT GGCTGGTTCA AATTCATTAA CAGGCAGAAG CAGGATGAAA TCGCCGAGAA ATACATCAAG CTGCTTGGCA TTCGCACGCC ATCGCCCGAC CAGCTGGTCA AAAACTTAAG TGGCGGCAAT CAACAAAAGG TCATCTTGGC CCGCTGGCTC GTTACGCAGC CACGCCTGCT GATTTTGGAT GAGCCGACAC GGGGCATTGA TGTCGGTGCC AAGGCTGAGA TTCAAAAGCT GGTACTGGAA CTCGTCAAGG ATGGCATGTC AATCGTCTTT ATTTCCTCTG AGTTGGAAGA GGTGGTACGT ATCAGCGACC GGATTGTGGT CTTGCGCGAC CGCGCGAAAG TGGCCGACTA CGACCACGCG GTGAGTGATC GGACGCTTAT TCAAACGATG GCAGGTGAGG CATGA
|
Protein sequence | MADSQSLITV RGISKAFPGV VALERVDFTL RKGEIHALMG ENGAGKSTLI KILTGVEQPD TGSIELEHTL IRIRSPQHAQ QLGISTVYQE INLCTNISVA ENIMLGHEPR RFGGIHWKKL NEMARQALLR LGIELDVTQP LGMYSIAIQQ MVAITRALEI ASAKVLILDE PTSSLDANET ARLFAVMRAL KQAGIGIVFV THFLDQVYEI VDRVTILRNG SFVGSYDIAD LPRVELVAKM LGRVVGELQA LAQDKARGER QPDERRLIEA AGLGLSGMLE PLDLAINVGE VLGIAGLLGS GRSELASLLF GLTSPDSGTL LIDGQSVTRF SPQESIKRGV ALCPEDRKAE GIVGDLSIRE NIILGLQGRY GWFKFINRQK QDEIAEKYIK LLGIRTPSPD QLVKNLSGGN QQKVILARWL VTQPRLLILD EPTRGIDVGA KAEIQKLVLE LVKDGMSIVF ISSELEEVVR ISDRIVVLRD RAKVADYDHA VSDRTLIQTM AGEA
|
| |