Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2327 |
Symbol | |
ID | 4245209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3601653 |
End bp | 3603533 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638107422 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_722022 |
Protein GI | 113475961 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.761686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000043688 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTATTCCT TTGATAATCT GAACTTAGTA ATGTTTAGTG GCAAAGGAGG CGTGGGAAAA ACTACAAATT CTTGTGCTTT TGCTGGCCAT TGGGCAAAAA AATCTCCTAA TGAAAAGGTC TTGTTAATTT CTACTGATCC TGCACACTCT TTAGGAGATG TATTGCAAAT AAGTGTTACA GATACACCAA GACCTTTAGA AGATTTACCT AATCTTTTAG TCAGAGCTTT AGATGTTAAT CTTCTGCTTG AAGAATTTAA AAAACGTTAC GGCGATGTCC TAGAAGTAAT TGTAGAGCGA GGTAGTTTTG TTGAGGGGGG AGACCTGACT CCAGTATGGG ATTTAGATTG GCCAGGATTA GATGAATTAA TGGCTTTACT GGAAATACAA AGGCTTTGTA ACGAAAAAGT AGTTGATCGC GTAGTAGTGG ACATGGCTCC GAGCGGTCAT ACTCTAAACT TATTCAAACT AATGGATTTT TTGGATACTT TTCTCAATTC TCTAGAGCTT TTTCAAGAGA AACATAAATA TATAAAGAAA AGCTTCTCAG GTTCTTATAT ACCAAATGAA GTTGATGAAG TATTACAAAA TCTTAAGGAT GAATTAGCAG CAGGTCGTCA TTTATTACAA AATTCATCTA ATACGGCTTG TTTAGTTGTA GCACTACCTG AACCGATGAG TTTTCGAGAA ACTCAGAGGT TTTTAAGTTC TTTGGAAGAG ATCAAAATTC CTTACGCTGG TATTGTTGTC AATCAAATTG TTGTAGATAA AGATGGTAAT AGCGATCGCT ACCACGAACA GCAAAAGTTA GTTAATGACT TTATTAAACT CGCGGGGGAT AAACCAGTAT TTTTGGTACC CCAAGAAAAA GCTGAACCTT TAGGTGTAAC TGCTTTAGAA AAGCTAACAA ACCAAATTAA TCAAGCAACA ATTCAAGAAT TCTCAACTTC TATAAGCTTC AATATTCAAT GGCCAGAAAA GGTTCCTCCT GGTTTTACCG ACTTTATCGG AAAGGGTAAA CGTTTATTAA TTGTTGGAGG AAAAGGAGGT GTAGGAAAGA CTACGATAGC TGCTGCTATT AGTTGGGAAA TGGCAAAACG ATATCCCGAA AGACAAGTTA GGGCTGTTTC TATTGATCCT GCTCATTCTT TGGGAGATGC CTTTGGTATG GATCTATGTC ATGAACCATC TATAATTAGT TCTAACCTTA AGGGACAAGA AATAGAAGCA AATAAAGTTC TTGAAAAATT TCGAGAGGAT TATTTGTGGG AATTAGCAGA AATGATGAGC GGAGAAAAAT CCGAAAATCA AGCCTCTTTT GAAATGGCTT TTGCCCCTAA AGGATGGCGT CAAATTGTTG AGCAAGCTTT ACCTGGTATT GATGAAATAT TATCATTTAT AACAGTTATT GAGTTATTGG AGGAAAAACA AGAAGATTTA ATTGTTTTAG ATACAGCTCC CACAGGGCAT TTATTACGTT TTCTAGAAAT GCCCACTGCA ATACAGGATT GGTTAGGCTG GATTTTTAAA TTATGGATAA AATATCAAGA TATTATTGGT AAAACTGATT TTATGGGACG TTTGCGAACT TTACGCCAAC GGGTGGTTAA AGCACAAAAA ATTCTCAAAG ATCCAAAAAA AACTGAGTTT ATTGGAGTAA TACGTCCTCA AAAAGGAGTT ATTGCGGAAG CAGAACGTCT TTATAAATCT CTTGCAGAAA TGCACATACC ACAAAATTAT TTAGTTTTAA ACTGTTTTAC CTCTAATAGC GTTATCCCTA CTGACCAATT TCCGGGGGTA CAATTTGTTT GTATGCCAAT GTTACCTCGT TCAATTGAAC CTATTGAACA AATTAAAGGA GCAGCTACTT ATATATTCTA A
|
Protein sequence | MYSFDNLNLV MFSGKGGVGK TTNSCAFAGH WAKKSPNEKV LLISTDPAHS LGDVLQISVT DTPRPLEDLP NLLVRALDVN LLLEEFKKRY GDVLEVIVER GSFVEGGDLT PVWDLDWPGL DELMALLEIQ RLCNEKVVDR VVVDMAPSGH TLNLFKLMDF LDTFLNSLEL FQEKHKYIKK SFSGSYIPNE VDEVLQNLKD ELAAGRHLLQ NSSNTACLVV ALPEPMSFRE TQRFLSSLEE IKIPYAGIVV NQIVVDKDGN SDRYHEQQKL VNDFIKLAGD KPVFLVPQEK AEPLGVTALE KLTNQINQAT IQEFSTSISF NIQWPEKVPP GFTDFIGKGK RLLIVGGKGG VGKTTIAAAI SWEMAKRYPE RQVRAVSIDP AHSLGDAFGM DLCHEPSIIS SNLKGQEIEA NKVLEKFRED YLWELAEMMS GEKSENQASF EMAFAPKGWR QIVEQALPGI DEILSFITVI ELLEEKQEDL IVLDTAPTGH LLRFLEMPTA IQDWLGWIFK LWIKYQDIIG KTDFMGRLRT LRQRVVKAQK ILKDPKKTEF IGVIRPQKGV IAEAERLYKS LAEMHIPQNY LVLNCFTSNS VIPTDQFPGV QFVCMPMLPR SIEPIEQIKG AATYIF
|
| |