Gene Cpha266_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1248 
Symbol 
ID4570266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1417025 
End bp1418248 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content45% 
IMG OID639765839 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_911705 
Protein GI119357061 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.245815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTT TAACGTTTAC CGGAAAAGGC GGCGTAGGAA AGACAAGTGT TTCTGCAGCT 
ACAGCTGTTC GATTATCGGA GCTTGGTTAT CGTACGCTTG TCCTTTCAAC AGATCCTGCG
CATAGTCTGT CGGATTCGTT CAATCTTCCT CTTGGCGCCG AGCCAACAAA GATCAAGGAA
AATCTTCATG CCATCGAGAT TAATCCCTAT GTTGATCTGA AGCAGAATTG GCATGCTGTT
CAGAAATTCT ATACAGGAAT ATTCAAGCCC CAGGGCGTAT CGGGTGTTGT CGCCGATGAG
ATGACCATTC TTCCGGGAAT GGAAGAGCTG TTTTCCCTTT TGAGGATAAA ACGTTATAAA
ACTTCAGGAC TCTACGATGT TCTCGTACTC GATACCGCTC CGACAGGTGA AACCCTTCGC
TTGCTCTCTC TGCCGGACAC GCTTGCATGG GGCATGAAAG CCGTTAAAAA TGTTACCAAA
TATATCGTTC GGCCACTCAG CAAGCCCCTC TCCCGGATGT CTGACAAGAT CGCGCAATAT
ATTCCACCTG AAGAAGCGCT GGATTCTGTC GATCAGGTTT TTGATGAACT TGAAGATATT
CGCGAGATTC TGACCGATAA TCAGAAATCG ACTGTCCGTC TGGTGATGAA TGCTGAAAAG
ATGTCGATAA AGGAGACGAT GCGAGCACTT ACCTATCTCA ATCTGTATGG TTTCAAAGTC
GATATGGTGC TGGTAAACCG GTTGCTTGAC ACTAAGGAAA ACAGCGGATA TCTTGAAAAC
TGGAAAACCA TTCAGCAGAA ATATCTTGGA GAGATCGAAC AGAGTTTTTC GCCTCTTCCG
GTTAAAAAAC TCAGGATGTA TGAAGAAGAG ATTGTTGGTC TCAAGGCACT TGAGCTTTTT
GCCCGGGATA TGTATGGCGA AACTGATCCT GCCGATATGA TGTACGACGA ACCGCCGATC
AAGTTCGTTC GCACGGGTGA TATTTATGAG GTACAGTTAA AGCTTATGTT TGCCAATCCC
GTTGATATCG ATGTATGGGT TACTGGTGAT GAACTCTATG TACATATCGA AAACCAGCGC
AAGATTATCA CGCTTCCGAT CAGTTTAACC GGACTTGAAC CGGGAGATGC CTATTTCAAG
AACAAGTGGC TGCACATTCC TTTTGATCTT GACAATCATA AACAACACAA GACAACAAAG
CAGTACAATA AAGCTCTTAA TTGA
 
Protein sequence
MRILTFTGKG GVGKTSVSAA TAVRLSELGY RTLVLSTDPA HSLSDSFNLP LGAEPTKIKE 
NLHAIEINPY VDLKQNWHAV QKFYTGIFKP QGVSGVVADE MTILPGMEEL FSLLRIKRYK
TSGLYDVLVL DTAPTGETLR LLSLPDTLAW GMKAVKNVTK YIVRPLSKPL SRMSDKIAQY
IPPEEALDSV DQVFDELEDI REILTDNQKS TVRLVMNAEK MSIKETMRAL TYLNLYGFKV
DMVLVNRLLD TKENSGYLEN WKTIQQKYLG EIEQSFSPLP VKKLRMYEEE IVGLKALELF
ARDMYGETDP ADMMYDEPPI KFVRTGDIYE VQLKLMFANP VDIDVWVTGD ELYVHIENQR
KIITLPISLT GLEPGDAYFK NKWLHIPFDL DNHKQHKTTK QYNKALN