Gene Clim_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2021 
Symbol 
ID6355525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2239583 
End bp2240638 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content56% 
IMG OID642669619 
Productarsenical-resistance protein 
Protein accessionYP_001944032 
Protein GI189347503 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATGT CAACGAAACA ACTCTCCTTT CTCGACCGGT ACCTTACGCT CTGGATATTC 
CTCGCCATGG CAGTCGGCGT ATTCTCGGGC TATCTTGTTC CCGGTACGGC GGCGTTCTGG
AATACCTTTC AGTCGGGTAC CACCAATATC CCCATCGCCA TAGGGCTTAT CGTCATGATG
TATCCGCCTC TGGCCAAGGT GAAGTACGAG GAGCTTGGCG ATGTGTTCCG CAACACCAGA
GTGCTCGGGC TTTCCCTTGT GCAGAACTGG GTGATCGGGC CGGTGCTCAT GTTCGTGCTT
GCAGTGCTCT TTCTTTCCGA TATGCCGCAT TACATGGCCG GACTCATCCT GATCGGCCTG
GCACGCTGCA TTGCCATGGT GATCGTCTGG AACGAGCTGG CAAAAGGCGA TACGGAGTAC
GCGGCAGGTC TCGTCGCCTT CAACTCGGTC TTTCAGGTGC TGTTCTTCTC CGTTTACGCG
TGGGTATTCC TCACGGTGCT GCCCGGCTGG CTCGGCCTCT CGTCGTTCAG GGTCGATATC
ACCATCGCCG AAATCGCTTC GTCGGTTTTC ATCTACCTCG GCATTCCCTT CATTGCGGGT
TTTCTTACCC GGTTCGTCAT GCTCCGCATC AAAGGCAGGG AGTGGTACGA AACCGTTTTC
GTGCCGCGCA TCAGTCCCCT GACGCTCGTT GCGCTGCTCT TTACCATCGT GGTGATGTTC
TCGCTGAAAG GCGAGTATAT CGTGAAAATT CCCATGGATG TGGTGCGCAT CGCCATACCG
CTGCTTTGCT ACTTCATCAT CATGTTCTTC GTTTCGTTCT GGATGGGACG CAAGGTCGGT
GCGGATTACT CCAAAACAGC CACGCTCTCA TTCACGGCGG CGAGCAACAA CTTCGAGCTT
GCCATAGCCG TGGCCGTGGC GGTGTTCGGT ATCGATTCCG GCGAAGCGTT CGCCGCCGTG
ATCGGGCCGC TCGTGGAGGT TCCCGCCCTG ATCGGTCTGG TAAATGTATC GCTCTGGTTC
AGGGAGAAAT GGTTCGGGGG ATCATCTGCA GCTTAA
 
Protein sequence
MGMSTKQLSF LDRYLTLWIF LAMAVGVFSG YLVPGTAAFW NTFQSGTTNI PIAIGLIVMM 
YPPLAKVKYE ELGDVFRNTR VLGLSLVQNW VIGPVLMFVL AVLFLSDMPH YMAGLILIGL
ARCIAMVIVW NELAKGDTEY AAGLVAFNSV FQVLFFSVYA WVFLTVLPGW LGLSSFRVDI
TIAEIASSVF IYLGIPFIAG FLTRFVMLRI KGREWYETVF VPRISPLTLV ALLFTIVVMF
SLKGEYIVKI PMDVVRIAIP LLCYFIIMFF VSFWMGRKVG ADYSKTATLS FTAASNNFEL
AIAVAVAVFG IDSGEAFAAV IGPLVEVPAL IGLVNVSLWF REKWFGGSSA A