Gene Sare_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0738 
Symbol 
ID5707770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp821215 
End bp822522 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content68% 
IMG OID641270257 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001535648 
Protein GI159036395 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.20107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00760021 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACCG AGACGACCGC CGAGGTCTGG TCCGAAGCGG CACCCGCCCG CCCGATGGAC 
CTGCTGCGGT TCGCGACCGC CGGCAGCGTG GACGACGGCA AGTCGACCCT GATCGGCCGC
CTGCTGTACG ACACCAGGTC ACTGTTCACC GACCAACTCG CCGCGGTCGA GGCGGTCAGC
GCGGCCCGCG GTGACGAGTA CACCAACCTC GCGTTGCTCA CCGACGGCCT GCGGGCCGAG
CGGGAGCAGG GCATCACGAT CGACGTGGCG TACCGCTACT TCGCCACCCC CCGGCGCAAG
TTCATCATCG CCGACACCCC CGGGCACATC CAGTACACCC GGAACATGGT CACGGGCGCG
TCCACCGCCG ACCTCGCACT GGTCTTGGTC GACGCCCGTA AGGGCCTGGT GGAGCAGTCC
CGCCGGCACG CCTTCCTCTG CTCGCTGCTC CGGGTGCCCC ACCTGGTGCT GTGCGTCAAC
AAGATGGACC TGGTGGACTG GTCGAAAGAG GTCTTCGAGA AGATCGCCGA CGAGTTCACC
GCGTTCGCGG CGAAGCTCGA CGTGCCTGAC CTGACGGTGG TGCCGATCTC CGCGCTCAAC
GGCGACAACA TCGTCTCCCG GTCGGAGAAC ATGCCCTGGT ACGAGGGGCC GTCCCTCCTG
CACCACCTGG AACGGGTGCA CATCGCCAGC GACCGCAACC TGGTCGACGT GCGCTTCCCG
GTGCAGTATG TGATTCGTCC GCAGTCCACC ACCGTCACCG ACTACCGGGG GTACGCGGGC
CAGGTCGCCT CCGGTGTACT CAAGCCGGGT GACGAGGTGC TGGTGCTGCC GAGCGGTTTC
ACCAGCCGGA TCGCCGCAGT GGAGACCGCC GACGGGCCGG TCCCGGAGGC GTTCCCACCG
ATGTCGGTGA CCGTACGGTT GACCGACGAG ATCGACATTT CCCGGGGCGA CATGATCTGC
CGTCCGCACA ACGCCCCGGC CGTTGCCCAG GACATCGAGG CGATGGTCTG CTGGATGGAC
GAGACCCGGC CGTTGCAGGT CGGCGGCAGA TACGCGATCA AGCACACCAC CCGATCGGCG
CGGGCGATCG TCCGTGGGCT GCACTACCGG CTGGACATCA ACTCGCTGCA CCGGGACGAG
TCGGCGGGAG AGCTGCGGCT CAACGAGATC GGCCGGGTCC GGTTCCGGAC GATGGTTCCG
CTGCTCGCTG ACGAGTACCG CCGTAACCGC ACCACCGGCG GTTTCGTCAT CATCGACGAG
ACGACGAACC GGACGGTCGG CGCCGGCATG ATCGTCGAGG CGAGCTGA
 
Protein sequence
MTTETTAEVW SEAAPARPMD LLRFATAGSV DDGKSTLIGR LLYDTRSLFT DQLAAVEAVS 
AARGDEYTNL ALLTDGLRAE REQGITIDVA YRYFATPRRK FIIADTPGHI QYTRNMVTGA
STADLALVLV DARKGLVEQS RRHAFLCSLL RVPHLVLCVN KMDLVDWSKE VFEKIADEFT
AFAAKLDVPD LTVVPISALN GDNIVSRSEN MPWYEGPSLL HHLERVHIAS DRNLVDVRFP
VQYVIRPQST TVTDYRGYAG QVASGVLKPG DEVLVLPSGF TSRIAAVETA DGPVPEAFPP
MSVTVRLTDE IDISRGDMIC RPHNAPAVAQ DIEAMVCWMD ETRPLQVGGR YAIKHTTRSA
RAIVRGLHYR LDINSLHRDE SAGELRLNEI GRVRFRTMVP LLADEYRRNR TTGGFVIIDE
TTNRTVGAGM IVEAS