Gene Syncc9902_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2044 
Symbol 
ID3743004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1953168 
End bp1954340 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content57% 
IMG OID637772241 
ProductATP-sulfurylase 
Protein accessionYP_378045 
Protein GI78185611 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCA GTGCTTCTGC CTCAGCCAAG AGATCCGGAG TGATCGCTCC CTATGGCGGC 
ACGCTGGTGG ATCTCATGGT GCCGAGCGCG GATCAACCTG CGTTGAAGGC ATCAGCTACC
AAAACGTTGG AATGCTCAGA CCGCAACGCC TGTGACGTGG AATTGCTGGT GGTCGGAGGG
TTTTCCCCTT TACGCGGCTT TATGCACCAG GAGGACTACG ACGCTGTTGT GTCCGGTCAT
CGCACGTCAG CAGGCCATTT ATTCGGTTTG CCAATCGTGA TGGACACCGA TCGCGACGAC
GTGGTGGTGG GAGACAAACT CTTGCTGACT TACAAGGGGC AAGAGCTTGC TCTTCTCGAG
GTTGAGGACA AGTGGGAACC CAACAAGGTG GTTGAGGCCC AGGGGTGTTA CGGCACGACA
TCGCTTGAAC ACCCCGCTGT GCGCATGATC GCGATGGAAC GCAAATGCTT CTATCTAGGC
GGCACGCTGA AGGGTTTGGA GCTGCCAAGC CGCGTTTTCC CCTGCAAAAC CCCGGCCGAA
GTTCGTTCTG ATTTGCCCCA TGGCGAAGAC GTGGTGGCCT TCCAATGCCG TAACCCCATT
CACCGCGCCC ACTACGAACT GTTTACCCGG GCTCTACATG CCCAAAATGT GAGCGAGAAC
GCCGTGGTGT TAGTGCACCC CACCTGTGGA CCAACCCAGC AGGACGACAT CCCAGGGTCG
GTTCGTTTTG AGACCTACGA GCGCTTGGCG GCCGAGGTGA ACAATGATCG AATTCGGTGG
GCTTATCTCC CCTATGCCAT GCACATGGCA GGGCCACGGG AAGCCCTCCA GCACATGATT
ATTCGCAGGA ATTATGGGTG CACCCATTTC ATCATTGGCC GCGATATGGC GGGTTGTAAG
TCCTCTCTGA CTGGCGACGA TTTTTACGGC CCCTATGACG CTCAGAACTT TGCGAAGGAG
TGTGCACCAG AGCTCACCAT GGAGACGGTG CCTTCTCTGA ATCTTGTTTA CACGCAGGAG
GAGGGCTACG TCACCGCTGA ACATGCGGAA GCGCGTGGAC TCCATGTGAA AAAGCTCAGC
GGCACACAGT TCCGCAAGAT GCTGCGTGGT GGTGAGGAGA TTCCTGAGTG GTTTGCCTTC
AAGAGCGTCG TTGAGGTGCT CCGTTCCTCA TGA
 
Protein sequence
MTASASASAK RSGVIAPYGG TLVDLMVPSA DQPALKASAT KTLECSDRNA CDVELLVVGG 
FSPLRGFMHQ EDYDAVVSGH RTSAGHLFGL PIVMDTDRDD VVVGDKLLLT YKGQELALLE
VEDKWEPNKV VEAQGCYGTT SLEHPAVRMI AMERKCFYLG GTLKGLELPS RVFPCKTPAE
VRSDLPHGED VVAFQCRNPI HRAHYELFTR ALHAQNVSEN AVVLVHPTCG PTQQDDIPGS
VRFETYERLA AEVNNDRIRW AYLPYAMHMA GPREALQHMI IRRNYGCTHF IIGRDMAGCK
SSLTGDDFYG PYDAQNFAKE CAPELTMETV PSLNLVYTQE EGYVTAEHAE ARGLHVKKLS
GTQFRKMLRG GEEIPEWFAF KSVVEVLRSS