Gene Franean1_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1823 
Symbol 
ID5670225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2187817 
End bp2189022 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content77% 
IMG OID641240744 
Productarsenite-transporting ATPase 
Protein accessionYP_001506167 
Protein GI158313659 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.702579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.326449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCTGA CGGGCAAGGG TGGTGGTGGG ACCACGACGG TGGCAGCGGC GACGGCCACG 
CTCGCGGCCC AGCGCGGCCA CCGGACGCTC CTGGTCTGCG CCGACCCGGC CGCCGGCCTG
GCCGGCATCC TCGACCATCC GCTCGGCCCC GGTGAGGTCG AGCTCGAACC CGGGCTGTTC
GGCCTCCAAC TGGACCTGCG CCACGCGTTC GCCGAGCGCT GGCCGCAACT ACGCGCGCTG
ATCTCGGCCG CGGCCCCCGC CGCCGGTGTC GACCTCCTGG AGGTCGAGGA GCTCGCCGGC
CTGCCCGGCG CCCTCGAGGC GCTCACCCTG CTGGAGCTGC GCGACCGGCT CGAGTCGGAC
CGCTACGACG TCGTGGTCGT CGACGCAGGG CCGGCCAGCG CCGCGCTGCG GCTGCTGGCG
AGCCCGGAGA CGCTCGCGTG GGGCTGCCGC CGGCTTGGCT CCCCGGACGG CGTGCTCGCC
CGCTGGATGC GCCCGTCGCT GGCGCTGCCG GGCCGTCTCA CCGGGCGGGT GGCCGCGATC
GCCGGGCCCG CCTACGAGGC GGTGTCGTGG CTGGCACGCC TGGCGCGTGA CATGCGCGCC
CTGCTTGCCG ATCCCGTCGT GACCAGCGTG CGCCTGGTGC TGACCCCGGA GACGGCCGCG
CTCGCCCAGG CCCGGCGGAC CCTGAGCGCC CTGGCGCTGC ACGGCATCGG CGTGGACGCG
GTGGTCGCGA ACCGGGTGAT CGCTTCGGCC GGGGGCGACG CCTGGCGGGC CGGCTGGGCG
GCGGCGCACC GCCAGCAGCT CATCGAGATC GGCGCGTTCG TCGCGCCGCT GCCCGTCCTC
ACCGCCGCCT ACCGGGCCGG TGAGCCGCTC GGGCTGGAGG AGCTCGCCGC CTTCGGCGCC
GCGGCCTACG GAGACCTCGA CCCGGCCGCG GTGCTCAGCG TTCACCGGTC CGGCGAGGGG
GCGGGCCCGC GGGTCGAGCG CACCGAGGGC GGCTACGCCA TGTCGTTCGG CCTCCCCTTC
GTCGACCGGT CGGAGGTCGA CCTCGCCCGG GTCGGTGACG ACCTCATCGT CAGCGTCGGC
CCGCACCGGC GGCTCGTCCC GCTGCCCGCC GCGCTGCGCC GGTGTGATGT GTCAGGCGCC
CGGCTGGCCG AGGAGCGCCT CGTCGTGTCG TTCGTGCCGG ACCCGGCCCA GTGGGTGCGC
GCGTGA
 
Protein sequence
MLLTGKGGGG TTTVAAATAT LAAQRGHRTL LVCADPAAGL AGILDHPLGP GEVELEPGLF 
GLQLDLRHAF AERWPQLRAL ISAAAPAAGV DLLEVEELAG LPGALEALTL LELRDRLESD
RYDVVVVDAG PASAALRLLA SPETLAWGCR RLGSPDGVLA RWMRPSLALP GRLTGRVAAI
AGPAYEAVSW LARLARDMRA LLADPVVTSV RLVLTPETAA LAQARRTLSA LALHGIGVDA
VVANRVIASA GGDAWRAGWA AAHRQQLIEI GAFVAPLPVL TAAYRAGEPL GLEELAAFGA
AAYGDLDPAA VLSVHRSGEG AGPRVERTEG GYAMSFGLPF VDRSEVDLAR VGDDLIVSVG
PHRRLVPLPA ALRRCDVSGA RLAEERLVVS FVPDPAQWVR A