Gene Cyan8802_2514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2514 
Symbol 
ID8391839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2539853 
End bp2541046 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content43% 
IMG OID644980478 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_003138215 
Protein GI257060327 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.232029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTAA TCCTGATGAC TGGTAAAGGA GGAGTCGGTA AAACCTCTGT CGCTGCTGCT 
ACTGGACTTA GGTGTGCTGA ATTAGGCTAC AAAACCCTGG TTTTGAGTAC CGATCCTGCC
CATTCCCTGG CCGATAGTTT TGATTTAGAA TTAGGTCACG ATCCCCGTCA AATTCGTCCT
AATCTTTGGG GGGCCGAATT GGATGCACTC ATGGAATTAG AAGGAAATTG GGGGGCTGTC
AAACGCTACA TCACCCAAGT CCTACAAGCG AGGGGACTCG ATGGGGTTCA AGCCGAAGAA
TTAGCTATTT TGCCAGGAAT GGATGAAATT TTCGGGTTAG TGCGGATGAA ACGTCACTAC
GATGAAGGGA CCTACGATGT CCTCATTATC GACTCAGCCC CCACCGGAAC CGCCCTCAGG
CTATTAAGTA TCCCCGAAGT CGGGGGATGG TATATGAGAC GATTTTACAA ACCCTTACAG
GGAATGTCGG TGGCGTTGCG TCCTTTGGTT GAACCCCTCT TTAAACCCAT TGCCGGGTTT
TCCTTACCTG ACAAGGAAGT CATGGACGCA CCCTATGAAT TTTATGAGCA AATTGAAGCC
TTAGAAAAGG TTTTAACCGA TAATAGTCAA ACGACTGTTC GATTAGTCAC CAACCCCGAA
AAAATGGTGA TCAAGGAATC TTTACGAGCC CATGCCTATT TAAGTTTGTA CAATGTGTCT
ACGGATTTAG TTGTCGCTAA CCGCATTCTT CCTGAAACCG TCACGGATAG TTTTTTCCAC
CGATGGAAAG AAAATCAACA GGTTTACAAA CAAGAAATCT ACGATAATTT CCATCCGTTA
CCCGTTAAAG AAGTCCCCCT TTATTCAGAA GAAATGTGTG GGATAGAAGC CTTAGAACGT
CTGAAAGAAA CCTTGTATAA AGATGAAGAT CCTGCCCAAG TTTACTACAA AGAAGATACC
CTAAGAGTTG TCAAACAAGA CGATCACTAT AGTCTAGAGT TGTATCTGCC AGGTATTCCT
AAAGAACAAA TTCAACTCAA TAAAACAGGG GATGAGTTAA ATATTCGTAT TGGCAATCAT
CGACGGAATT TAGTCTTACC TCAAGCCTTA GCTGCCTTGA AACCTTCAGG CGCGAAAATG
GAGGAAGATT ATTTAAAAAT TCGGTTTACT CAAGGAATAC CTAGCAAAAT TTAA
 
Protein sequence
MRVILMTGKG GVGKTSVAAA TGLRCAELGY KTLVLSTDPA HSLADSFDLE LGHDPRQIRP 
NLWGAELDAL MELEGNWGAV KRYITQVLQA RGLDGVQAEE LAILPGMDEI FGLVRMKRHY
DEGTYDVLII DSAPTGTALR LLSIPEVGGW YMRRFYKPLQ GMSVALRPLV EPLFKPIAGF
SLPDKEVMDA PYEFYEQIEA LEKVLTDNSQ TTVRLVTNPE KMVIKESLRA HAYLSLYNVS
TDLVVANRIL PETVTDSFFH RWKENQQVYK QEIYDNFHPL PVKEVPLYSE EMCGIEALER
LKETLYKDED PAQVYYKEDT LRVVKQDDHY SLELYLPGIP KEQIQLNKTG DELNIRIGNH
RRNLVLPQAL AALKPSGAKM EEDYLKIRFT QGIPSKI