Gene Cpha266_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2297 
Symbol 
ID4569401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2629745 
End bp2630899 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content46% 
IMG OID639766859 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_912713 
Protein GI119358069 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.680488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTA TTCTTTACCT GGGTAAGGGC GGAGTTGGAA AAACTACCGT CTCGGCTTCA 
ACAGCAACTG CAATTGCCCG CCGCGGAGAA CGGGTGCTTA TTATGAGTAC GGATGTAGCC
CACAGTCTTG CTGACGCTTT GGGTGTGGAA TTAAGCCCGA CTCCTCTTGA AGTAGAACAA
AATCTGTTTG CGATGGAGGT CAATGTTCTG ACAGAAATCA GGGAGAACTG GTCTGAGCTT
TATTCCTATT TTTCCTCCAT TCTCATGCAT GACGGCGCAA ATGAGGTCGT TGCTGAAGAA
CTTGCCATTA TGCCGGGCAT GGAAGAGATG ATCAGTCTCC GATATATATG GAAAGCTGCC
AAGTCCGGAA ATTATGATGT TGTGGTTGTT GACGCAGCTC CGACAGGTGA AACCATGCGC
CTGCTTGGCA TGCCGGAATC CTATGGATGG TATTCCGAGA AAATCGGTGG ATGGCACTCT
AAAGCCATTG GCTTTGCTGC GCCGCTGCTG TCGAAATTCA TGCCTAAAAA GAATATTTTC
AAGTTGATGC CTGAGGTGAA CGAGCATATG AAAGAGCTGC ACACCATGCT GCAGGACAAA
AACATCACCA CGTTCAGAGT TGTCCTCAAC CCTGAGAACA TGGTGATCAA GGAAGCTCTT
CGAGTTCAGA CCTATCTGAA TCTTTTCGGT TACAAGCTCG ATGCCGCCAT AGTCAACAAG
GTTCTTCCTG AAAGCTCATC AGACCAGTAT CTGCAATGCC TTATTGACCT GCAGGCCAAG
TATCTGAAGG TTATTGAAAA CTGTTTTTTC CCTGTTCCGA TTTTCAGGGC AAAACAGTCC
ACGGCTGAGG TTATCACCCC GGACAGGCTT TATGAACTGA GTCAGGAGAT TTTTGCTGAT
CAGAATCCTT CAGCGGTGCT TTACAGCAAT GAAAAGACCC AGACGCTTGA GAAAATAAAC
GGCAAATACG TTCTGAGCCT CTATCTGCCT AATGTAGAGG TGACAAAGCT GAATGTCAAT
ATCAAGGGAG ATGAATTACT GATTGACATC AACAATTTCC GTAAAAGCAT TATTTTGCCC
AATGTTCTCG TTGGAAGAAA AACGGAGGGG GCCGACTTTG TTTCCGGAAA CCTCAATATA
ACCTTTGCAA ACTGA
 
Protein sequence
MRIILYLGKG GVGKTTVSAS TATAIARRGE RVLIMSTDVA HSLADALGVE LSPTPLEVEQ 
NLFAMEVNVL TEIRENWSEL YSYFSSILMH DGANEVVAEE LAIMPGMEEM ISLRYIWKAA
KSGNYDVVVV DAAPTGETMR LLGMPESYGW YSEKIGGWHS KAIGFAAPLL SKFMPKKNIF
KLMPEVNEHM KELHTMLQDK NITTFRVVLN PENMVIKEAL RVQTYLNLFG YKLDAAIVNK
VLPESSSDQY LQCLIDLQAK YLKVIENCFF PVPIFRAKQS TAEVITPDRL YELSQEIFAD
QNPSAVLYSN EKTQTLEKIN GKYVLSLYLP NVEVTKLNVN IKGDELLIDI NNFRKSIILP
NVLVGRKTEG ADFVSGNLNI TFAN