Gene Daro_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2945 
Symbol 
ID3566135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3179233 
End bp3180513 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content62% 
IMG OID637681414 
Productsulfate adenylyltransferase subunit 1 
Protein accessionYP_286145 
Protein GI71908558 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.53268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC AAGTTGAACG AGTTTACGAA GATGCCGGCC TGCTGCGCTT TCTGACCTGC 
GGCAGCGTCG ATGACGGCAA GAGCACCCTG ATCGGCCGCC TGCTGTTCGA CACCAAGACC
ATCCTCGCCG ATACACTGAG CGCCATCGCC AAGACTTCCG AAAAGCGCGG CATGGGCGCG
GTCGACCTCT CGCTGCTGAC TGACGGCCTG CAGGCCGAGC GCGAGCAAGG CATCACCATC
GACGTGGCCT ACCGCTACTT CTCGACCGGC ACGCGCAAGT ACATCATCGC CGACGCGCCG
GGCCACGAGC AGTACACCCG CAACATGGTC ACCGCCGCCT CGACCGCCAA CCTGGCCATC
ATCCTGATCG ATGCGCGCAA AGGTGTGCTG ACCCAGACCC GCCGCCACTC CAAGCTCGCC
TCGCTGGTCG GCATCCCGCA CCTGATCGTC GCCATCAACA AGATGGACCT CGCCGATTAT
TCGCAGGAAA CCTACGAGCG CATCAAGGGC GAATACCTTG AATTCGCCGC CAAGGTCGGC
ATCGAAGACA TCCGCTTCAT CCCGCTCTCG GCGCTCAATG GCGACATGAT CGTCGACCGT
GGCGACAAGC TGAACTGGTA CGAAGGTCCG ACCCTGCTCG AAATGCTCGA AACCGCCCCG
GCCGCCCATA CCGAGCACAC CGAGAAGTTC CGTTTCCCGG TCCAGTACGT CTGCCGCCCG
CAGGATTCTG CCAACCCAGA GCTGCACGAC TACCGCGGCT TCATGGGTCG CGTCGAAGCC
GGTTCGATCA AGGTCGGCGA TGCCGTCACC GTTTTGCCTT CCGGCCGCGA GTCGACTGTC
AAAGCCGTGC AACTGGGCGG CGTCGATATC GGCGAAGCCT TCTGCGAACA GTCGATCACC
CTGCTGCTGG CCGATGAAAT TGACACCTCG CGTGGCGACA TGATCGTCAA GTCGAGCGAA
GTGCCGGCTG CCGTCAAGCA GATCGAAGCC ACCGTCTGCT GGATGGCCGA ACAACCGATG
GACCGCGCCC GCACCTACCT GATCCGCCAC ACGACGCGCG ACTCGAAGGC CAAGCTGGCC
GCCATCGACC ATCGCCTCGA TGTGAATACG CTGGAAAAAG TCCCGGCCGA AAAGTTGGCC
ATGAACGACA TCGCCCAAGT CACCTTCAAG CTGGCCCAGC CGCTGTTCGC CGATCCCTAC
CTGGAAAATC GCGGCACCGG CGCTTTCATC ATCATCGACG AAAGCAACAA CAACACGGTC
GGCGCCGGCA TGATCCTCTA A
 
Protein sequence
MTAQVERVYE DAGLLRFLTC GSVDDGKSTL IGRLLFDTKT ILADTLSAIA KTSEKRGMGA 
VDLSLLTDGL QAEREQGITI DVAYRYFSTG TRKYIIADAP GHEQYTRNMV TAASTANLAI
ILIDARKGVL TQTRRHSKLA SLVGIPHLIV AINKMDLADY SQETYERIKG EYLEFAAKVG
IEDIRFIPLS ALNGDMIVDR GDKLNWYEGP TLLEMLETAP AAHTEHTEKF RFPVQYVCRP
QDSANPELHD YRGFMGRVEA GSIKVGDAVT VLPSGRESTV KAVQLGGVDI GEAFCEQSIT
LLLADEIDTS RGDMIVKSSE VPAAVKQIEA TVCWMAEQPM DRARTYLIRH TTRDSKAKLA
AIDHRLDVNT LEKVPAEKLA MNDIAQVTFK LAQPLFADPY LENRGTGAFI IIDESNNNTV
GAGMIL