Gene Arth_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3122 
Symbol 
ID4444355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3503185 
End bp3504618 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content69% 
IMG OID639690948 
Productsulfate adenylyltransferase subunit 1 
Protein accessionYP_832600 
Protein GI116671667 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG AAATCGATAC AGCCCGCGCG GCCGCCCTTC TTGACGAGGC CCCCCTCGCG 
CACGCGTCGC TGTTCCGCTT CGCCACCGCA GGATCGGTCG ACGACGGCAA GTCCACTTTG
GTGGGCCGCC TCCTGCACGA CTCCAAGGCA ATCCTCGCCG ACCAGCTCGA CGCCGTCGCC
CGCACCTCCG CGGACCGCGG ATTTGGCGGC GCCGGGGCCA CCGGCACGAA AGCGATCGAC
CTCGCCCTCC TGACCGACGG CCTGCGTGCC GAGCGCGAGC AGGGCATCAC CATCGACGTC
GCCTACCGCT ACTTCGCCAC CGACCGCCGC AGCTTCATCC TGGCTGACTG CCCCGGGCAC
GTGCAGTACA CCAAGAACAC GGTGACCGGC GCGTCCACCG CGGATGCCGT CGTCGTACTC
ATTGACGCCC GCAAGGGTGT CCTGGAGCAG ACCCGCCGGC ACCTCTCCGT GCTGCAGCTG
CTGCGCGTGG CCCACGTGAT CGTGGCCGTG AACAAGATCG ACCTGGTGGA CTTCAGCGAG
GACGTGTTCC GCGGGATCGA GGCCGACGTG CAGAAGGTTG GCCGCGAACT GGGCCTCGGA
GCCGATGGCA TCACCGACCT GCTGGTGGTT CCGGTTTCCG CGCTCGACGG CGACAACGTG
GTGGAGCGCT CGGAGCGCAC CCCCTGGTAC ACGGGCCCGG CACTGCTCGA AGTCCTCGAA
ACCCTTCCTG CCGCGGACGA ACTGGAAAGC CACCTGGAGA GCTTCCGTTT CCCGGTGCAG
CTCGTCATCC GGCCGCAGGG CGCGCTGGCT CCCGACGCGG TTGCCGGCGG ACTCGACGTC
GAGAAATACC GTGACTACCG TGCCTACGCC GGGCAGATCA CCGAAGGCTC GGTGCAGGTG
GGGGACAAGG TCAGCGTGCT GACCCCCGGC CAGGACCCGC GCACCACCAC GGTGACGGGC
ATCGACTTCG CGGGCGCCGA GCTCACCGAA GCCGTGGCAC CGCAGTCGGT GGCAATCCGC
CTCGCTGACG AATTCGATGT GGCTCGCGGT GACACGATCG CCGCCGCAGG CACCGTCCGT
GAAGCCTCCG CCGACCTCTA CGCCGCGCTT TGCTGGCTGT CCCCAAAGCC GCTCCGCGAG
GGCGCCAAGG TGCTGGTCAA GCACGGCACG CGCACCGTGC AGGCGCTGGT CCGCAGCGTC
AGCGGGAAAC TGGACCTCGC CACCTTCAAG CTTGAGGGCG CGTCCAGCCT GGAGCTCAAC
GACATCGGCC ACGCGCAGCT CCGGCTCGCC GCCCCGCTGC CGCTGGAAAA CTACCTCCAC
CACCGCCGTA CCGGCGCGTT CCTGGTGATC GATCCGCTCG ACGGCAACAC CCTGGCCGCC
GGCCTGGTCA ATGACCACCC GGGCGACCAC GAGGACGAGC GCTACAGCAT CTGA
 
Protein sequence
MSTEIDTARA AALLDEAPLA HASLFRFATA GSVDDGKSTL VGRLLHDSKA ILADQLDAVA 
RTSADRGFGG AGATGTKAID LALLTDGLRA EREQGITIDV AYRYFATDRR SFILADCPGH
VQYTKNTVTG ASTADAVVVL IDARKGVLEQ TRRHLSVLQL LRVAHVIVAV NKIDLVDFSE
DVFRGIEADV QKVGRELGLG ADGITDLLVV PVSALDGDNV VERSERTPWY TGPALLEVLE
TLPAADELES HLESFRFPVQ LVIRPQGALA PDAVAGGLDV EKYRDYRAYA GQITEGSVQV
GDKVSVLTPG QDPRTTTVTG IDFAGAELTE AVAPQSVAIR LADEFDVARG DTIAAAGTVR
EASADLYAAL CWLSPKPLRE GAKVLVKHGT RTVQALVRSV SGKLDLATFK LEGASSLELN
DIGHAQLRLA APLPLENYLH HRRTGAFLVI DPLDGNTLAA GLVNDHPGDH EDERYSI