Gene BURPS1106A_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1017 
SymbolcysN 
ID4901023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp993920 
End bp995236 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID640134247 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001065297 
Protein GI126455060 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCA TCGAGAACAA CGAAGACCTC GGCGTACTGC GGTTCATCAC GGCGGGCAGC 
GTCGACGACG GCAAGAGCAC GCTGATCGGG CGACTGCTGT ACGACAGCAA GGCGGTGCTG
TCCGACCAGC TCTCCGCGCT GTCGCGCGCG AAGAACAAGC GCACGGTGGG CGACGAGCTC
GATCTCGCGC TGCTTACCGA CGGCCTCGAG GCCGAGCGCG AGCAGGGCAT CACGATCGAC
GTCGCGTACC GCTACTTCGC GACCGCGAAG CGCAAGTTCA TCATCGCCGA CACGCCCGGC
CACGAGCAGT ACACGCGCAA CATGGTGACG GGCGCGTCGA CCGCGCATGC GGCGATCATC
CTGATCGACG CGACGCGCGT GACGTTCGAC GCGGGCGCGG CGCAACTGCT GCCGCAGACG
AAGCGCCACA GCGCGATCGT CAAGCTGCTC GATCTGCAGC ACGTGATCGT CGCGATCAAC
AAGATGGATC TCGTCGACTA CAGCGAGACG CGCTTCAACG AGATCCGCGA CGCGTACGTG
AAGCTCGCGC AGCAGCTCGG CCTGGCCGAC GTGCGCTTCG TGCCGGTGTC GGCGTTGAAG
GGCGACAACA TCGTCGCGGC GAGCGAGCGG ATGCCGTGGT ATGCGGGCGA GCCGTTGCTG
AACGTGCTCG AAACGCTGCC CGTCGAGACG CAGGCGCATG ACGCGCTGCG CTTTCCGGTG
CAATGGGTCG CGCGCCAGGA CGGCAGCTCG GCCGACGATT TCCGCGGCTA CATGGGCCGC
ATCGAGGCGG GCGAGGCGAA GGTGGGCGAC GAGATCGTCG TGCTGCCTTC GAACCGTACC
GCGACGATCG CCGAGATCAT CGCGCCGGTG CCGGGCGGCA CGGCGGCCGT CGAGCGCGCG
TTCGCCGGGC AGGCGGTGAC GATCCGCCTG GCCGAGGACG TCGACGTGTC GCGCGGCGAC
ACGTTCGTGC CGCGCGCGCA GGGCGTCGAG CCGGCGAAGA AGCTCGAGGC CGATCTCTGC
TGGTTCGACG AGACGCCGCT TTCGTCGCAG CGCAAGTATC TGCTCAAGCA AACGACGAAC
ACCGTGTTCA CGAAGATCGG CGCGGTCAAG CAGGTGCTCG ACGTGCACAC GCTGTCGCAC
GCGACCGATC GCCACGAGCT GAAAATGAAC GACATCGGCC GCGTCGCGCT GACGCTGCAA
AAGCCGATCG TCTGCGACAC GTACGACGCG CATCCGGGCA CGGGCGCGTT CGTGCTGATC
GACGAGGCGA CCCATCACAC GGTCGCAGCG GGTATGATTC GTGCGTTTTC CGCGTGA
 
Protein sequence
MSIIENNEDL GVLRFITAGS VDDGKSTLIG RLLYDSKAVL SDQLSALSRA KNKRTVGDEL 
DLALLTDGLE AEREQGITID VAYRYFATAK RKFIIADTPG HEQYTRNMVT GASTAHAAII
LIDATRVTFD AGAAQLLPQT KRHSAIVKLL DLQHVIVAIN KMDLVDYSET RFNEIRDAYV
KLAQQLGLAD VRFVPVSALK GDNIVAASER MPWYAGEPLL NVLETLPVET QAHDALRFPV
QWVARQDGSS ADDFRGYMGR IEAGEAKVGD EIVVLPSNRT ATIAEIIAPV PGGTAAVERA
FAGQAVTIRL AEDVDVSRGD TFVPRAQGVE PAKKLEADLC WFDETPLSSQ RKYLLKQTTN
TVFTKIGAVK QVLDVHTLSH ATDRHELKMN DIGRVALTLQ KPIVCDTYDA HPGTGAFVLI
DEATHHTVAA GMIRAFSA