Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2822 |
Symbol | |
ID | 5734703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3589640 |
End bp | 3590734 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279965 |
Product | putative flavin-binding monooxygenase involved in arsenic resistance |
Protein accession | YP_001545588 |
Protein GI | 159899341 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2072] Predicted flavoprotein involved in K+ transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0300735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAATC AAACGATTGT GATTGGGGCT GGTCAGGCTG GCTTGGCGGC TGGCTATTGG CTGCAACAAG CTAAAATTCC ATTTCAAATT ATCGAGGCTC AAGCCAGCGT TGGTGGCTCG TGGCCCGCCT ATTACGATAG TTTGAGCTTG TTTTCGCCTG CGCGATTTTC AAGTTTGCCA GGCATGGCTT TCCCTGCGCC CGCCGATAGC TACCCGCAGC GCGATACTGT TGTGGCCTAT TTGCAGCGCT ATGCTGAACA TTTCAATTTA CCGATTCAGA CGAATACGGC AATTAGTACG ATCGAACCGC AGAATGGTGG CTTTCGCCTG ACAAGTAGTG CGGGACAAGT ATTTCATGCG GGTCAAATTA TTGCCGCAAC TGGAGCGTTT GCGCGGCCAT TTATGCCCGA ATTGCCCAAC CAAGCTGCAT TTCAAGGCAA GATTTTACAT AGTGCTCGCT ATCGCGATTC AGCAGATTTT GTGGGCAAAC GGGTTGTGGT GGTTGGCGCA GGCAATTCGG CAATTCAAAT TGCGATCGAG CTGGCCCAAG TTGCCGATGT GACCCTAGCT ACGCGCCAGC CAATTCGCTT TCAAGCCCAA CGGATTGCAG GCCGTGATAT CCATTGGTGG TGGTGGCTGA CAGGTTTTGA TCGGCTGGCA CTCAACACGC GAGTTGGCCG TTGGATTCAG CAACGAACCC AAGGAATTGT CCTCGATACT GGTCTGTACA GCAGAAAAAT CAATCAAAAC CAACCCCAAC GCCGAGCCAT GTTCGAACGT TTTAGCGCAA CGGGTGTAGT TTGGGCTGAT GGACAGCCCG AAGCAGTTGA CATAGTATTG TTTGCGACTG GCTATCGACC TCATTTGAGC TATTTGCAAG GCCTCAATGC GCTTGATCAA GCTGGCTTGC CCTTGCATCG GGCAGGTGTG AGCACCACAG TTGAGGGCTT GTATTACGTT GGCTTGGAGC AACAAACTAA TTTTGCCTCG GCTACGTTGC GTGGAGTTGG GCCGGATGCA TCGCGAGTGG TGCGCCATAT TCAAGTTCGC ATGCAGCAAC CAGCAGGTTG TTGTCGCTGG GCGATGGGGC AATGA
|
Protein sequence | MANQTIVIGA GQAGLAAGYW LQQAKIPFQI IEAQASVGGS WPAYYDSLSL FSPARFSSLP GMAFPAPADS YPQRDTVVAY LQRYAEHFNL PIQTNTAIST IEPQNGGFRL TSSAGQVFHA GQIIAATGAF ARPFMPELPN QAAFQGKILH SARYRDSADF VGKRVVVVGA GNSAIQIAIE LAQVADVTLA TRQPIRFQAQ RIAGRDIHWW WWLTGFDRLA LNTRVGRWIQ QRTQGIVLDT GLYSRKINQN QPQRRAMFER FSATGVVWAD GQPEAVDIVL FATGYRPHLS YLQGLNALDQ AGLPLHRAGV STTVEGLYYV GLEQQTNFAS ATLRGVGPDA SRVVRHIQVR MQQPAGCCRW AMGQ
|
| |