Gene Namu_4376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4376 
Symbol 
ID8450002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4859025 
End bp4860455 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID645043423 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_003203652 
Protein GI258654496 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGG TTGTGAACGA CAGCGCTGCG GTCACCGAGG TGGTCGGTGC GGGCGCCGCC 
GGGGTCGATA CCGCCAGCGA AGCGCACGAA TTGGCGCAGG AGCTCTCACA CGCCCGCGGG
CTGCTGCGGT TCGCCACCGC CGGCTCGGTC GACGACGGCA AGTCGACCCT GGTCGGCCGG
CTGCTGCACG ACTCCAAGTC GGTGCTCTCC GATCAGCTCG ACGCGGTGGA GCGCACCTCG
CAGGCTCGTG GCCTGGAGAC CGTCGATCTG TCCCTGCTGG TGGACGGCCT GCGGGCCGAG
CGCGAGCAGG GCATCACCAT CGACGTGGCC TACCGCTACT TCGCCACCCC GACCCGGACC
TTCATCCTGG CCGACACCCC CGGGCACGTG CAGTACACCC GCAACACGGT GACCGGGGCA
TCCACCGCCG ACCTGGTGGT GATCCTGACC GACGCCCGCA CCGGGGTCGT CGCGCAGACC
CGCCGGCACG CCGCGGTGGC CGCGCTGCTG GGGGTGCCGC ACCTGCTGCT GGCCGTCAAC
AAGATCGACT TGGTCGACTA CGACCAGGGC GTGTTCGACG CCATCGCCGG TGAGTTCTCC
GCCTACGCCA AGGGTCTGGG CATCCACGAG ATCACCTCCA TCCCGATCTC GGCGCTCAAC
GGCGACAACG TCGTCACCCG CTCGGAGCGG ATCGACTGGT ACGACGGGCC CAGCGTGCTG
GAGCATCTGG AGACGGTCGA CCTGGCCGCC GACACGGACG AGGCCCTGCG GTTCCCGGTG
CAGTACGTCA TCCGGCCGCG CTCGGACGAG CTGCCCGACT ACCGCGGCTA CGCCGGCCGG
GTCGGGGCCG GCACGGTCCG CCCGGGCGAC GAGGTCGTCG TGCTGCCCTC CGGGCGACGC
TCGACCGTCA CCGGGATCGA CACCGCGGAC GGCCCCCGGC CGTCGGCCAC CAAGGGTGAT
TCGGTCACGC TGCTGCTCTC CGACGATGTC GACCTGTCCC GCGGCGACCT GATCGCCGGG
GTGCAGGATG CCCCGACCCC GGTCGTCGAG TTCACCGCAA CGGTCACCCA GCTGGCCGAC
AAGCCGCTCA AGCCGGGTGC CCGCACCCTG TTGCGGTACG GCGCGACCTC GACCCGGGCG
CTGGTCACCT CGCTGGACCA CCTGCTGGAC ATCGACTCGC TGACCTACGG GCCGGCGCCG
GAGTCGTTGG CCCTCAACGA CATCGCCCAG GTGACCATCC GCACCGCCGA TGCCGTGCCG
GTCGAGGCCT ATCGGCCCGG CGGCGCCGTC GGTTCGCTGC TGATCATCGA CCCGGCCGAC
GGCACCACGC TGGCCGCCGG CATGGTCGGC GACCGGTTGG CCGCCCTGCA TCCCGAGACC
ACCACCCCCG CTGCCCCCGC CGCCGAGACC GAGGAGGACT GGCTGTCATG A
 
Protein sequence
MSTVVNDSAA VTEVVGAGAA GVDTASEAHE LAQELSHARG LLRFATAGSV DDGKSTLVGR 
LLHDSKSVLS DQLDAVERTS QARGLETVDL SLLVDGLRAE REQGITIDVA YRYFATPTRT
FILADTPGHV QYTRNTVTGA STADLVVILT DARTGVVAQT RRHAAVAALL GVPHLLLAVN
KIDLVDYDQG VFDAIAGEFS AYAKGLGIHE ITSIPISALN GDNVVTRSER IDWYDGPSVL
EHLETVDLAA DTDEALRFPV QYVIRPRSDE LPDYRGYAGR VGAGTVRPGD EVVVLPSGRR
STVTGIDTAD GPRPSATKGD SVTLLLSDDV DLSRGDLIAG VQDAPTPVVE FTATVTQLAD
KPLKPGARTL LRYGATSTRA LVTSLDHLLD IDSLTYGPAP ESLALNDIAQ VTIRTADAVP
VEAYRPGGAV GSLLIIDPAD GTTLAAGMVG DRLAALHPET TTPAAPAAET EEDWLS