Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1956 |
Symbol | |
ID | 5733845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2377255 |
End bp | 2380071 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279100 |
Product | ATPase |
Protein accession | YP_001544727 |
Protein GI | 159898480 |
COG category | [R] General function prediction only |
COG ID | [COG0714] MoxR-like ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.512123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGAG TCGATGATCG TCCGCTGTTC TACGCCGCCG CTCAGAGCTT TGTTGATCGA GCATTGCGGG CTGACGATTC GCTATTTACG CCTGGCGTTG CTGTTTGGAG TGCGGCCAAC CTTGATGATC TCTACCAACG CTTCGTCGGC CAACCCGACG AATCAGCTGA TAGCTTCATG ATCAAATTCC AACGCCAACT GCATGAAGCG CAACCAACCA TCATCCAGCT TGCTGCCGAA CTCCAATTTG TCTACTATCT GATTTCGCGC AAAATCACCG GACGCGCCAA GCGCGACCAG ATTAATACCA TCCTCAAATG GTCACCCGAG CCAGTCAGCA TTCCCCATGA GCTTGACGGA GCGCTTGATC AAGGCATCGC CAACACCGGC ACAGCCTATC AGACCTACAA ATTTTACCAA CTAAGCTTTA TCATCGAATT TATGCAACAC TGGAAAGGGC TATCGCAGGC AGCTCGTACA ACAGCATTGG CTGATCCATG GGAATTTAAG CAGATTTTAT TCTCGCTGCC AATCAAAACC GCCTATGCCG CCCGCGAAAT GCTGCTGCAT CTGGTACACC CCGATAGCTT TGAATCGATC GTTTCGCGTG ATCATAAAGC TAATTATGCT CGCCAATATG CCCATCATAA ACAAACCACC AGCAACGATA TTGATCGCCA GTTGTGGGAG ATTCGCCGCG CCTTGACCCC ACAGTATGGG GCAAACTTCA GCTTCTATGC CATTCAACAT GATGAAACTC AGCGGCCTGA TTTCCCTGTG CCCCTGCCAT TGGGGCCAAA ACTACGCCCC TATATTCAGC TGGTAGCCAT GCTCTCCACC AACAGTTACA GTGCCGAGCA AATCGTTGAT GTGCTCGGGC AGGCCAACCC ACCCTTAGTA CAACTCACGG CCCGCCCGAA TGCCGATGAT CTGCTTGATG TATTGCAACT GTTACGTTTA GTTGAGCAAC TGCCCGACGA CCGCTATCGG CGCTGGCCGC ACCTCAACGA TCTGCATGAG GAAACCATGC TGCGCTATAG TGCGCTAACC CTCGTGCTGC CCGATAGCGA AGCTAACGAC GATTATTGGC TACCAATTAT GGCGATGCCC TTCGATGGAG TAGCTCACCC CGCCGAAGCA TGGCCTGGCC CGGCCTTGTT ACGTGATTGG TATCGCGAGG CAGGCTTGAT TGAGCAGGGC GAGCATAGTT GGCTGCGCAG TCGCCCCGCT GCCTTACAAC CCATAGCCAA CCCAACCACC CCCACTGCCC ATGCGATCAA TAGCTTTCTT GAGCATATCG AGCGCGTGCA ACGCAGCCAA CGCAGCACCA TGGATAGCGC CTTGCAAGAT CAACCGCTGC CGCAACTGAC CACCGCCGTG CTCAACGAGC GCATCGCCGA ACTGCAACGC GAACTGTTGG TCAACCGCGT CACCCTACTG CGAATCTACC GCGCCCTGAT CGCAGGCCAG CATGTGATTT TGAGTGGCCC GCCAGGCACT GGCAAAACCC ACTTAGCCCA ACGGCTGCCC GAGATCTTCT GGCGCGATAC TGATGCCAGT ATCAGCTTGC GCATGCCAAC CTCGCCTGCC TTACCACCGA CCGAGCCACC AATTGAAGAG CGCCATACCC GCCATGGCTA TGCCGTTCAA ATTGAAACTG CCACCGAAAC CTGGAGCAGC CGAGATATTA TTGGCGGCAT CGTGCCTCAA CTCCAGCGCA GTGCTGGCGG CAAAACCTTG GTGTATGGAG TGCGCCATGG CTGCCTCACC AGCGCAGTGC TTTCGAACTA TGCGGGCTAC GACGGCGAAA ACGTGCCCAA CCCCGAAACC CTGCAACGCA CTGAGGTGAT GGTTAAACAG CAACGCTACC GTGGTCGCTG GCTGGTCATC GACGAATTTA CCCGAGCGCA TATTGATGCC GCTTTTGGCA GCTTGCTCAC GACCCTTGGC GGCCAACGCA ACGCCCCACT CAGCATCCCA ACCGACGATG GCGTTGTAGT GCAAGTGCCT TTGCCACGCG ATTTCCGCAT CATCGGCACG CTCAACTCGT TTGATCGTCA CTTTCTCAAC CAAATGAGCG AAGCCATGAA ACGGCGTTTT GTCTTTATTG ATCTGTTGCC ACCAAGCAGC GACGATGCCG ATGAAGAGCA AGGCATCGCG GCCTATCGTG CCCTTTTACG CCTGAGCGAC CAAAAACTCG ATACCATCGC CAGCAACGAT GCAGCCGGAC GAGCCACGTG GCGTGGTGTG CTCAATGTTA ATCGCGAGAT CAATCGCGAT GGTGACGGCT CACGGGTCAA CTATCGGCTA GAGGTCGAAA ATCAGGATGC CAAAGATGTA TTGGATAGTT TCTGGCGCAT CTTCAGCGCA ATTCGGGTCT ATCGCCAGCT TGGCACTGCC CAGGCCGAGG CAGTGTATGC CGCAACCTTT GCTGGCCATG CCATTGGCTT GAGTTGGCAC GATGCGCTCG ATTACGCCCT TGCCGATACC TTGGCCGACC AACTGCAAGT GCTCAATCGC GATGAACAGC GGGCCTTATT GGCCTATCTG GCCTATGCTC ACAATCCCAA AGAATTTAGT GAGCAGCTCA AAACGATCAT CAAGAGCCTG CCGTTGCCGC GCCAAGCCAG CCATTTAACC CAACTCCAAA CAGCTAGCCC CAAGCATGCT AGTGGCTCGA TCAACATAGC CAACATTGAT GAGCTAACGA TGACGCACAT TAGCCAAATT TTTGATCTCG GCACAGAATT GCTGCTCGAT CAAACCAGTC AATTTGCCCA ACGCTTGCGC ACGTTCAGCA GCGAACGAGG CCTGTGA
|
Protein sequence | MARVDDRPLF YAAAQSFVDR ALRADDSLFT PGVAVWSAAN LDDLYQRFVG QPDESADSFM IKFQRQLHEA QPTIIQLAAE LQFVYYLISR KITGRAKRDQ INTILKWSPE PVSIPHELDG ALDQGIANTG TAYQTYKFYQ LSFIIEFMQH WKGLSQAART TALADPWEFK QILFSLPIKT AYAAREMLLH LVHPDSFESI VSRDHKANYA RQYAHHKQTT SNDIDRQLWE IRRALTPQYG ANFSFYAIQH DETQRPDFPV PLPLGPKLRP YIQLVAMLST NSYSAEQIVD VLGQANPPLV QLTARPNADD LLDVLQLLRL VEQLPDDRYR RWPHLNDLHE ETMLRYSALT LVLPDSEAND DYWLPIMAMP FDGVAHPAEA WPGPALLRDW YREAGLIEQG EHSWLRSRPA ALQPIANPTT PTAHAINSFL EHIERVQRSQ RSTMDSALQD QPLPQLTTAV LNERIAELQR ELLVNRVTLL RIYRALIAGQ HVILSGPPGT GKTHLAQRLP EIFWRDTDAS ISLRMPTSPA LPPTEPPIEE RHTRHGYAVQ IETATETWSS RDIIGGIVPQ LQRSAGGKTL VYGVRHGCLT SAVLSNYAGY DGENVPNPET LQRTEVMVKQ QRYRGRWLVI DEFTRAHIDA AFGSLLTTLG GQRNAPLSIP TDDGVVVQVP LPRDFRIIGT LNSFDRHFLN QMSEAMKRRF VFIDLLPPSS DDADEEQGIA AYRALLRLSD QKLDTIASND AAGRATWRGV LNVNREINRD GDGSRVNYRL EVENQDAKDV LDSFWRIFSA IRVYRQLGTA QAEAVYAATF AGHAIGLSWH DALDYALADT LADQLQVLNR DEQRALLAYL AYAHNPKEFS EQLKTIIKSL PLPRQASHLT QLQTASPKHA SGSINIANID ELTMTHISQI FDLGTELLLD QTSQFAQRLR TFSSERGL
|
| |