Gene Cag_1587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1587 
Symbolsat 
ID3746662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2071946 
End bp2073160 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content45% 
IMG OID637774127 
Productsulfate adenylyltransferase 
Protein accessionYP_379885 
Protein GI78189547 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.369869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTGG TCAATCCCCA CGGAAAAGAA AAAGTTCTTA AGCCGCTATT GCTCACCGGT 
GAAGAGTTGA CTGCCGAAAA AGCTCGAGCG CAATCGTTTG CACAAGTGCG TTTATCGTCT
CGTGAAACGG GCGACCTTAT TATGCTTGGT ATTGGCGGTT TTACTCCACT AACAGGCTTT
ATGGGGCATG ATGATTGGAA GGGAAGTGTA CAAGATTGCC GCATGGCTGA TGGTACTTTT
TGGCCTATTC CCATTACCCT TTCCACTTCA AAAGAAAAAG CTGACGAACT CTCCATAGGG
CAAGAAGTTG CTCTTGTTGA CGATGAATCG GGTGAATTGA TGGGGAGTAT GGTTATTGAA
GAGAAGTACT CTATTGATAA AGCTTTTGAG TGTCAAGAGG TTTTTAAAAC CACCGATCCT
GAGCATCCAG GTGTGTTAAT GGTTATGAAC CAAGGGGATG TAAACCTTGC TGGACGTGTC
AAAGTTTTTA GTGAAGGCAC CTTTCCTACT GAATTTGCAG GTATTTACAT GACACCTGCT
GAAACCCGCA AAATGTTTGA GGCAAATGGT TGGAGCACAG TAGCTGCCTT CCAAACCCGC
AACCCGATGC ACCGCTCCCA CGAATATCTT GTTAAAATTG CGATTGAAGT ATGTGATGGC
GTTTTAATCC ATCAGCTTCT TGGTAAGCTT AAGCCGGGTG ATATTCCTGC CGATGTTCGT
AAAGAGTGCA TTAATGCGTT GATGGAAAAA TATTTTGTGA AAGGCACTTG CATACAAGGA
GGTTATCCGC TTGATATGCG CTATGCAGGT CCTCGTGAGG CGTTGCTTCA TGCGCTGTTC
CGCCAGAATT TTGGTTGCAG TCACTTAATA GTTGGTAGAG ACCACGCAGG CGTAGGCGAC
TACTATGGAC CTTTTGATGC CCACCACATT TTCGATCAAA TTCCTGCCGA TGCACTTGAA
ACCAAACCGC TCAAAATAGA TTGGACATTC TACTGCTATA AGTGTGATGG CATGGCTTCT
ATGAAAACTT GCCCACACAC GGCTGAAGAT CGTCTTAACC TCAGTGGCAC GAAACTACGT
AAAATGCTTT CTGAAGGCGA GCAAGTGCCT GAGCATTTTA GCCGTCCTGA AGTGCTTGAA
ATTCTCCAAC GTTATTATGC TTCGCTGACG CAAAAGGTTG ATATTAAACT GCATAGCCAT
GCAGTTGGTA AATAA
 
Protein sequence
MSLVNPHGKE KVLKPLLLTG EELTAEKARA QSFAQVRLSS RETGDLIMLG IGGFTPLTGF 
MGHDDWKGSV QDCRMADGTF WPIPITLSTS KEKADELSIG QEVALVDDES GELMGSMVIE
EKYSIDKAFE CQEVFKTTDP EHPGVLMVMN QGDVNLAGRV KVFSEGTFPT EFAGIYMTPA
ETRKMFEANG WSTVAAFQTR NPMHRSHEYL VKIAIEVCDG VLIHQLLGKL KPGDIPADVR
KECINALMEK YFVKGTCIQG GYPLDMRYAG PREALLHALF RQNFGCSHLI VGRDHAGVGD
YYGPFDAHHI FDQIPADALE TKPLKIDWTF YCYKCDGMAS MKTCPHTAED RLNLSGTKLR
KMLSEGEQVP EHFSRPEVLE ILQRYYASLT QKVDIKLHSH AVGK