Gene Hhal_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2354 
Symbol 
ID4709077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2579540 
End bp2581453 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content70% 
IMG OID639856829 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001003919 
Protein GI121999132 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.643084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCACG CCTCCGATCT CATCGAAACC GACATCGAGC GCTACCTCAA GCAGCACGAG 
CACAAGGACC TGCTGCGCTT CATCACCTGC GGCTCGGTGG ACGACGGCAA GAGCACCTTC
ATCGGCCGGC TGCTGCACGA CTCGGCGCTG GTCTACGAGG ATCAACTGGC CGCCGTGCGG
CAGGACTCCA CCCGCTACGG CACCACCGGC GACGACGTGG ACCTGGCGCT GCTCGTCGAC
GGCCTGCAGT CCGAGCGCGA GCAGGGCATT ACCATCGATG TCGCCTACCG CTACTTCTCC
ACCGACCGGC GCAAGTTCAT CATTGCCGAC ACCCCGGGCC ACGAGCAGTA CACCCGCAAC
ATGGCCACCG GCGCCTCGAC GGCGCAGCTG GCGGTGATCC TGGTCGATGC CCGCCACGGG
GTGCAGGTGC AGACCCGGCG GCACAGCTAC ATCTGCGCCC TGCTGGGGAT CCGCCACGTG
CTGCTGGCCG TCAACAAGAT GGACCTGGTC GATTGGGATC AGGGCACCTT CGAGGCGATC
CGCGACGAGT ACACCGCCTT CGCCCGCCGC CTCGGCATCC CCGATGTGCG CTGCGTGCCG
CTGTCCGCGC TCAAGGGGGA TAACGTCGTC CACCGCGGCG AGCACCTGCC CTGGTACGAC
GGCCCAACGC TCATGGAGCT GCTGGAGACC GTGGAGGCCA AGGCCGACCG CAATCTGCGC
GATCTGCGCC TGCCGGTGCA GACTGTGGTC CGCCCCTCCC ACGACTTCCG CGGCTTCGCC
GGCACCCTGG CCGCCGGCAC GGTGCGCCCC GGGGACGAGG TCGTGGCCCT GCCTTCCGGG
CTGCGCAGCC GCGTGGCGCG CATCGTCACC TACGACGGCG ACCTCGACGT GGCCTTTCCG
CCGCAGTCGG TGACCGTCAC CCTGACCGAC GAGATCGACG TCTCACGCGG CGATGTCCTG
GCCAGCCCGA CCCACCCGGC CACCGTGGAC GACACCCTGG ATGCGCGCAT CGTGTGGATG
GCCGAGCAAC CGCTGCTGCC CGGACGCCAG TACGACATCA AGCTGGGCAC GGCCACCGTC
CCGGCCGTGG TCGAACGGAT CCACCACCGC ATCGACGTCA ACACCCTCGA GCACCACCAG
GTGGAGGAAC TCGGGCTCAA CGAGATCGGC CTGTGTCGCG TCCAGCTCTC CGCCCGGGTG
CCCTTTGACC CCTACGACGA GATCGCCAAC ACCGGATCGT TCATCGTCAT CGACCGGATG
AGCCTGCACA CCGTCGGCGC CGGCATGGTC ACCCGCGCGG CAACCGAGGC CGCCGGCGCC
GAGACCGATG TCCCGCGCCG CCGGCTGGCC CTGAGCAAGG CCCAGCGCGC CGGGCAGAAG
GGGCAGCGGC CGTGCATCGT CTGGCTCACC GGGCTGTCCG GCTCCGGCAA GTCGAGCCTG
GCCAACGCCC TGGAGCAGGC GCTGTTCCGG CGCGGCTACC ACAGCTACCT GATCGACGCG
GGCAACGTCC GCCACGGGCT GAGCCACGAC CTGGACTTCA GCCGCGACGC GCGGGCCGAG
AACATCCGCC GCATGGCCGA GACAGCCACC ATGTTCGTCG ACGCCGGGCT GATCACCGTC
TGCGCCAGCC TCTCGCCGTA CCGCGACGAC CGCGCCATGG TCCGCGAGCG GGTCGAACCC
GGCGAGTTCA TCGAGGTGCA CGTGGACGCC ACCATCGACG CGTGCCGCGC CGCGGACCTG
GACGGGCTCT ACGCCCGCGC CGACGCTGGC GAGATCCAGG GCCTGCCCGG TGTGGACATC
CCCTACGAGG CGCCGGAACA GCCCGAGGTC CGCGTGGACA CGGTGGCCGA GGACCTGGAG
ACCTCGGTGC GCAAGATCCT CACCGCCCTG GAGGAGCGCG GGGTGCTGCG CTAG
 
Protein sequence
MSHASDLIET DIERYLKQHE HKDLLRFITC GSVDDGKSTF IGRLLHDSAL VYEDQLAAVR 
QDSTRYGTTG DDVDLALLVD GLQSEREQGI TIDVAYRYFS TDRRKFIIAD TPGHEQYTRN
MATGASTAQL AVILVDARHG VQVQTRRHSY ICALLGIRHV LLAVNKMDLV DWDQGTFEAI
RDEYTAFARR LGIPDVRCVP LSALKGDNVV HRGEHLPWYD GPTLMELLET VEAKADRNLR
DLRLPVQTVV RPSHDFRGFA GTLAAGTVRP GDEVVALPSG LRSRVARIVT YDGDLDVAFP
PQSVTVTLTD EIDVSRGDVL ASPTHPATVD DTLDARIVWM AEQPLLPGRQ YDIKLGTATV
PAVVERIHHR IDVNTLEHHQ VEELGLNEIG LCRVQLSARV PFDPYDEIAN TGSFIVIDRM
SLHTVGAGMV TRAATEAAGA ETDVPRRRLA LSKAQRAGQK GQRPCIVWLT GLSGSGKSSL
ANALEQALFR RGYHSYLIDA GNVRHGLSHD LDFSRDARAE NIRRMAETAT MFVDAGLITV
CASLSPYRDD RAMVRERVEP GEFIEVHVDA TIDACRAADL DGLYARADAG EIQGLPGVDI
PYEAPEQPEV RVDTVAEDLE TSVRKILTAL EERGVLR