Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1823 |
Symbol | |
ID | 5670225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2187817 |
End bp | 2189022 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240744 |
Product | arsenite-transporting ATPase |
Protein accession | YP_001506167 |
Protein GI | 158313659 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.702579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.326449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCCTGA CGGGCAAGGG TGGTGGTGGG ACCACGACGG TGGCAGCGGC GACGGCCACG CTCGCGGCCC AGCGCGGCCA CCGGACGCTC CTGGTCTGCG CCGACCCGGC CGCCGGCCTG GCCGGCATCC TCGACCATCC GCTCGGCCCC GGTGAGGTCG AGCTCGAACC CGGGCTGTTC GGCCTCCAAC TGGACCTGCG CCACGCGTTC GCCGAGCGCT GGCCGCAACT ACGCGCGCTG ATCTCGGCCG CGGCCCCCGC CGCCGGTGTC GACCTCCTGG AGGTCGAGGA GCTCGCCGGC CTGCCCGGCG CCCTCGAGGC GCTCACCCTG CTGGAGCTGC GCGACCGGCT CGAGTCGGAC CGCTACGACG TCGTGGTCGT CGACGCAGGG CCGGCCAGCG CCGCGCTGCG GCTGCTGGCG AGCCCGGAGA CGCTCGCGTG GGGCTGCCGC CGGCTTGGCT CCCCGGACGG CGTGCTCGCC CGCTGGATGC GCCCGTCGCT GGCGCTGCCG GGCCGTCTCA CCGGGCGGGT GGCCGCGATC GCCGGGCCCG CCTACGAGGC GGTGTCGTGG CTGGCACGCC TGGCGCGTGA CATGCGCGCC CTGCTTGCCG ATCCCGTCGT GACCAGCGTG CGCCTGGTGC TGACCCCGGA GACGGCCGCG CTCGCCCAGG CCCGGCGGAC CCTGAGCGCC CTGGCGCTGC ACGGCATCGG CGTGGACGCG GTGGTCGCGA ACCGGGTGAT CGCTTCGGCC GGGGGCGACG CCTGGCGGGC CGGCTGGGCG GCGGCGCACC GCCAGCAGCT CATCGAGATC GGCGCGTTCG TCGCGCCGCT GCCCGTCCTC ACCGCCGCCT ACCGGGCCGG TGAGCCGCTC GGGCTGGAGG AGCTCGCCGC CTTCGGCGCC GCGGCCTACG GAGACCTCGA CCCGGCCGCG GTGCTCAGCG TTCACCGGTC CGGCGAGGGG GCGGGCCCGC GGGTCGAGCG CACCGAGGGC GGCTACGCCA TGTCGTTCGG CCTCCCCTTC GTCGACCGGT CGGAGGTCGA CCTCGCCCGG GTCGGTGACG ACCTCATCGT CAGCGTCGGC CCGCACCGGC GGCTCGTCCC GCTGCCCGCC GCGCTGCGCC GGTGTGATGT GTCAGGCGCC CGGCTGGCCG AGGAGCGCCT CGTCGTGTCG TTCGTGCCGG ACCCGGCCCA GTGGGTGCGC GCGTGA
|
Protein sequence | MLLTGKGGGG TTTVAAATAT LAAQRGHRTL LVCADPAAGL AGILDHPLGP GEVELEPGLF GLQLDLRHAF AERWPQLRAL ISAAAPAAGV DLLEVEELAG LPGALEALTL LELRDRLESD RYDVVVVDAG PASAALRLLA SPETLAWGCR RLGSPDGVLA RWMRPSLALP GRLTGRVAAI AGPAYEAVSW LARLARDMRA LLADPVVTSV RLVLTPETAA LAQARRTLSA LALHGIGVDA VVANRVIASA GGDAWRAGWA AAHRQQLIEI GAFVAPLPVL TAAYRAGEPL GLEELAAFGA AAYGDLDPAA VLSVHRSGEG AGPRVERTEG GYAMSFGLPF VDRSEVDLAR VGDDLIVSVG PHRRLVPLPA ALRRCDVSGA RLAEERLVVS FVPDPAQWVR A
|
| |