Gene Cphamn1_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1839 
Symbolsat 
ID6375530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1996195 
End bp1997409 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content49% 
IMG OID642684335 
Productsulfate adenylyltransferase 
Protein accessionYP_001960237 
Protein GI189500767 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0886206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0259449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTGG TAAATCCGCA TGGAAAAGAT AAGGTTTTAA AGCCGCTGTT ATTGTCCGGT 
GAGGAATTGC AGAATGAGAT GGAAAAAGCG AAGTCACTGA AAGAGGTTCG CTTGTCGTCG
AGGGAGACCG GCGATCTCAT CATGCTCGGA ATAGGTGGAT TTACTCCTTT GGAAGGGTTT
ATGGGGTACG ACGACTGGAA GGGAAGTGTA GAGAACTGTA TGATGGCTGA TGGAACGTTC
TGGCCGATTC CTATCACGCT TTCGACCTCG AAGGAACTTG GTGACACCCT CGGTATAGGA
GAGGAAGTTG CGCTTGTTGA CGATGAATCC GGCGAGCTTA TGGGGAGCAT GGTTGTCGAA
GAGAAGTACG AGATCGACAA GGCTCATGAG TGCAGGGAGG TTTTCAAGAC TGACAATATC
GAGCATCCCG GTGTCCTGCA GGTTATGCAA CAGGGTGAGG TGAATCTCGG TGGTCCTGTA
AAAGTTTTCA GTGAAGGTTC TTTTCCTTCC GAGTTTGCAG GTGTATATAT GACTCCTGCA
GAGACAAGGG CGCTTTTCGA GAAAAACGGA TGGAGTACCG TTGCCGCCTT TCAGACAAGA
AATCCCATGC ACCGCTCACA TGAGTATCTT GTCAAAATCG CGATTGAAAT CTGTGACGGC
GTGCTGATTC ATCAGCTTCT CGGTAAACTG AAGCCCGGTG ATATCCCTGC GGATGTCAGA
AAAGATTCCA TCAACGCCTT GATGGAGAAC TACTTTGTAA AGGGAACCTG TATTCAGGGC
GGCTATCCTC TCGATATGCG CTATGCCGGT CCGAGAGAGG CGCTTCTTCA TGCTCTGTTC
AGGCAGAACT TCGGCTGCAG TCACCTGATT GTCGGTAGAG ATCACGCCGG TGTCGGTGAC
TACTATGGCC CCTTTGACGC GCATCACATT TTTGATGAAA TTCCCCGGGA TGCTCTCGAA
ACAAAACCTC TCAAGATAGA CTGGACTTTT TACTGTTACA AATGTGACGG TATGGCCTCC
ATGAAGACCT GTCCTCATGG TAAGGATGAC AGATTGAGCC TGAGCGGCAC GAAGCTCAGA
AAGATGCTTT CTGAAGGCGA GGAAGTTCCC GATCACTTCA GCCGTCCTGA AGTTCTTGAG
ATTCTGAAGA AATATTATGC CGGCCTTGAA GAGAAAGTAG AGGTCAAGAT GCACACCCAT
GCAGAGGGTA AATAA
 
Protein sequence
MPLVNPHGKD KVLKPLLLSG EELQNEMEKA KSLKEVRLSS RETGDLIMLG IGGFTPLEGF 
MGYDDWKGSV ENCMMADGTF WPIPITLSTS KELGDTLGIG EEVALVDDES GELMGSMVVE
EKYEIDKAHE CREVFKTDNI EHPGVLQVMQ QGEVNLGGPV KVFSEGSFPS EFAGVYMTPA
ETRALFEKNG WSTVAAFQTR NPMHRSHEYL VKIAIEICDG VLIHQLLGKL KPGDIPADVR
KDSINALMEN YFVKGTCIQG GYPLDMRYAG PREALLHALF RQNFGCSHLI VGRDHAGVGD
YYGPFDAHHI FDEIPRDALE TKPLKIDWTF YCYKCDGMAS MKTCPHGKDD RLSLSGTKLR
KMLSEGEEVP DHFSRPEVLE ILKKYYAGLE EKVEVKMHTH AEGK