Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2086 |
Symbol | |
ID | 5733974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2597208 |
End bp | 2598986 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279227 |
Product | ABC transporter related |
Protein accession | YP_001544854 |
Protein GI | 159898607 |
COG category | [V] Defense mechanisms |
COG ID | [COG1132] ABC-type multidrug transport system, ATPase and permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCTT GGCAATTAAT GCTGCGTTTG ATCCGCTATC GGCCAGCAAT CTATGGCTTT GGGGTCTTGA TTTGGAGCAT ATTTTTTATC TCGCCCTTGC TGCCTGGCCT GATCAATCGG GCAATTTTCG ACCAACTCAG CGGCGCGGCT CCAGCGGTAT TTGGCATCTG GACGCTCTTA GCCCTCTTGA TCACCACTGA GATTAGCCGA CGTTTATTGA CCCAACTTGG AACCGCGATG GATATCTCGG TGCAATATAA CATTGCAGGC TTGTTGCGCA AAAATATGCT GAAGCGGATT TTGGCTCAGC CAGGTGCCCA AGCCTTGTAT ACCTCATCAA GTGCGGCGAT CAACAATTTT CGCGATGATG TTGAGGAAAT TCTGACCTTT AGCATTTGGC TGCTTGATGT ATTTGGTAAT GTGCTGTTCA GCCTGATTGC GCTAGGAATC CTGTTACAAA TCAATCAACA GGTAACCTTG GTTATTTTGG TTCCATTGCT GATCGTGGTT TCGGCCTTTA CCGTTGCCAA TCAGCGGATT CAGCGCTATC GTCAGGCCAG CCGCGCCGCA ACGAGCAAGG TTTCCGATTT TCTGGGCGAA ATGTTTGGGG CAACCCAAGC GATCAAAGTG GCGAATGCTG AAACTGCGGT AATCGGCCAT TTTCAACAAC TCAACGACCA ACGTCGCCAG TTTATGGTCA AAGATCGGGC ATTTACGGCC TTGCTTAGCT CCGTTTACGC CAACACAATT AGCCTTGGCA CGGGCATGAT TTTGTTGCTA GCAGGCGAGG CAATGCGTAC TGGCAATTTT AGTGTTGGTG ATTTCTCGCT GTTTGTCTAT TATTTTTCAA TGGTTACCCG TTTGCCCTCA CTGATTGGCT TGTTGTTGAC CCACTACAAA CAGGCTGGAG TTTCGTTCCA GCGCATGCAA CAGCTATTAA AGGATGCAGC ACCAAGCGCT TTGGTTGAAC ATGGCTCGAT TACGCCCGAT CAACAGCTGG ATTTTAGCCC AGCCAAGGCC TTGCCAGCAT TGCAACGGCT TGATATAAGC AACTTAAGCT ATTGCTACCC CAACAGCCAA GCTGGTATCA AAGCGATTGA TTTCAGGCTT GAGCGTGGCC AATTTGTGGT AGTTACGGGC AAAGTTGGCT CAGGCAAAAC CACCGTATTG CGGGCCTTGC TTGGTTTAGT GCCAGCCCAT GGCACGATTC TCTGGAATAA TCAGCCGATC GAACAACCTG CAGAGCAACT CATTCCCCCC AATTGTGCTT ATACCGCGCA AGTGCCGCGC TTATTGAGCA GCTCGCTGCA CGATAATTTG AGCCTAGGGC TGAATTTGAG CGCTAATGAA TTACAACAGG CGATTGAAAC GAGTGTATTA GCGCCAGATC TACAGCATAT GCCCGCAGGC CTAGCGACCG AACTTGGTTC ACGGGGCGTG CGTTTATCGG GCGGTCAACT ACAACGAGCA GCTTTAGCGC GAATGTTAGT GCGTTCGAGC GAGTTGATGA TCGTTGATGA TTGTTCGAGT GCCTTAGATG TGACCACTGA GCAACAACTG TGGCAAGGCC TACGTCAGCA CCCAACCACA TGGCTGGTTG TTTCGCATCG CTGCGCCGTC CTTCAGCTTG CCGATTGGGT AATTGTTATG GACGCTGGTC AGATTGCAGC CCAAGGCCCA CTAAATGATC TACTCGAAAC ATCGCCAGCT ATGCGTGAAC TTTGGCAAGC TGAGCCGAGC AATCAAAAAC TATTGCTTGG TGTTGATGCG CACGTTTAG
|
Protein sequence | MKAWQLMLRL IRYRPAIYGF GVLIWSIFFI SPLLPGLINR AIFDQLSGAA PAVFGIWTLL ALLITTEISR RLLTQLGTAM DISVQYNIAG LLRKNMLKRI LAQPGAQALY TSSSAAINNF RDDVEEILTF SIWLLDVFGN VLFSLIALGI LLQINQQVTL VILVPLLIVV SAFTVANQRI QRYRQASRAA TSKVSDFLGE MFGATQAIKV ANAETAVIGH FQQLNDQRRQ FMVKDRAFTA LLSSVYANTI SLGTGMILLL AGEAMRTGNF SVGDFSLFVY YFSMVTRLPS LIGLLLTHYK QAGVSFQRMQ QLLKDAAPSA LVEHGSITPD QQLDFSPAKA LPALQRLDIS NLSYCYPNSQ AGIKAIDFRL ERGQFVVVTG KVGSGKTTVL RALLGLVPAH GTILWNNQPI EQPAEQLIPP NCAYTAQVPR LLSSSLHDNL SLGLNLSANE LQQAIETSVL APDLQHMPAG LATELGSRGV RLSGGQLQRA ALARMLVRSS ELMIVDDCSS ALDVTTEQQL WQGLRQHPTT WLVVSHRCAV LQLADWVIVM DAGQIAAQGP LNDLLETSPA MRELWQAEPS NQKLLLGVDA HV
|
| |