Gene Sare_4250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4250 
Symbol 
ID5704382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4822316 
End bp4824229 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content68% 
IMG OID641273669 
Productglucosamine--fructose-6-phosphate aminotransferase 
Protein accessionYP_001539022 
Protein GI159039769 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0388234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0232179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGGAA TCGTGGGATA CGCCGGCGAG CGTCCGGCGC TGGGCATCGT GCTGGATGGG 
CTGCGACGGC TGGAGTACCG CGGCTACGAC TCAGCAGGAG TCGCCATCAC CTGCGGGGAC
GAACTGCTGG CGGAGAAGAG GGCCGGAAAG TTGGCCAACC TGGAGAAGGT GCTCTCCGAA
CGCTCCGCGC AGGACCCGGA GGCGTGCGGC GCGTCCCCCA TCGGGATCGG GGACGGTACC
ACCGGTATCG GCCACACCCG CTGGGCCACC CATGGCGGCC CCACGGACCG TAACGCACAC
CCCCACCTGT CCCCCGACGG GCGGATTGCC GTGATCCACA ACGGCATCAT CGAGAACTTC
GCGAAGCTGC GCGCCGAACT GGAAGCCGAC GGCGTCCAGT TCGTCAGCGA CACCGACACC
GAATGCGCCG TCCACCTGCT CGCCATCGCC CTCGCAGACC TGCGCGCGGC CGGCCATCCG
GACGGGCCGC AGCTGCTGTC CGCCGGGATG CGGGTGGTGT GCCAGCGACT TGAGGGGGCG
TTCACCCTGC TCGCGGTGGA TGCCGGTATC CCGGGGGCCG TGGTCGGTGC CCGGCGCAAC
TCGCCACTGG TCGTCGGCCG CGGCGCCGGT GAGAACTACC TGGCCAGCGA TGTCACCGCG
TTCATCGAGC ACACTCGGGA CGCGGTGGAG CTGGGTCAGG ACCAGATCGT GTTGATCACC
AGCGACAGCA TCGAGATCAC CGATTTCGCC GGGCAGCCCG CGAGTGGCAA GGACTTCCAC
ATCGACTGGG ACTCCTCGGC CGCGGAGAAG GGCGGCTACG ACTGGTTCAT GCTCAAGGAG
ATCGAGGAGC AGCCCCAAGC CGTGGCGGAC ACGTTGCTCG GTCGGCTCAC CGAGAGCGGC
GAGATCATGC TCGACGAGGT CCGGCTGAGC GACCAGGACC TGCGCGACGT CGACAAGATC
TTCATTGTTG CCTGCGGCAC CGCATACCAC TCCGGCATGG TCGCCAAGTA CGCCATCGAA
CACTGGACCC GGATCCCCTG CGAGGTGGAG CTGGCCAGCG AATTCCGCTA CCGCGACCCG
GTGCTCGACC GGTCCACCCT CATCGTGGTG ATCTCGCAGT CCGGCGAGAC GATGGACACC
CTGATGGCGC TGCGGCACGC CAAGGAGCAG AAGGCCCGGG TACTGGCCAT CTGCAACACC
AACGGCTCCA CCATCCCCCG TGAGTCCGAC GCGGTCCTCT ACACCCACGG CGGCCCGGAG
ATCGCCGTCG CCTCCACCAA GGCGTTCCTC ACCCAGCTCG TCGCCTGCTA TCTGATCGGC
CTGCACCTCG CGCAGGTGCG CGGGATCAAG TTCGCCGACG AGGTAGCCGC CGTGGTCAAC
CAGCTGCACC AGATGCCCGG CAAACTGCGT GAGCTGCTGG GCCGGATCGA GCCGGTACGC
GAGCTGGCCC GCGAGTTGAA GGGCCAGCCG ACCGTGCTGT TCATCGGCCG CCACGTCGGA
TACCCGGTGG CGCTGGAAGG TGCGCTCAAG CTCAAGGAAC TGGCCTACAT GCACGCCGAG
GGGTTCGCGG CCGGCGAACT CAAGCACGGC CCGATCGCGT TGATCGACAA GGGCACCCCG
GTGATCTGTG TCGTACCGTC GCCGGTGGGT CGGGGCATGC TGCACGACAA GGTCGTCTCC
AACATCCAGG AGGTGCGGGC CCGTGGCGCC CGCACGATCG TGATCGCGGA GGAGGGCGAC
GAGGCGGTCG TCCGCTTCGC CGACCACCTG ATCTATGTAC CGCGTACGCC GACTCTGCTC
ACGCCGCTGG TGACCACCGT GCCGCTGCAG GTCTTCGCCG CGGAGATCGC CGCAGCGCGT
GGCCACGATG TCGATCAGCC CCGCAACCTG GCGAAGTCCG TGACAGTTGA GTGA
 
Protein sequence
MCGIVGYAGE RPALGIVLDG LRRLEYRGYD SAGVAITCGD ELLAEKRAGK LANLEKVLSE 
RSAQDPEACG ASPIGIGDGT TGIGHTRWAT HGGPTDRNAH PHLSPDGRIA VIHNGIIENF
AKLRAELEAD GVQFVSDTDT ECAVHLLAIA LADLRAAGHP DGPQLLSAGM RVVCQRLEGA
FTLLAVDAGI PGAVVGARRN SPLVVGRGAG ENYLASDVTA FIEHTRDAVE LGQDQIVLIT
SDSIEITDFA GQPASGKDFH IDWDSSAAEK GGYDWFMLKE IEEQPQAVAD TLLGRLTESG
EIMLDEVRLS DQDLRDVDKI FIVACGTAYH SGMVAKYAIE HWTRIPCEVE LASEFRYRDP
VLDRSTLIVV ISQSGETMDT LMALRHAKEQ KARVLAICNT NGSTIPRESD AVLYTHGGPE
IAVASTKAFL TQLVACYLIG LHLAQVRGIK FADEVAAVVN QLHQMPGKLR ELLGRIEPVR
ELARELKGQP TVLFIGRHVG YPVALEGALK LKELAYMHAE GFAAGELKHG PIALIDKGTP
VICVVPSPVG RGMLHDKVVS NIQEVRARGA RTIVIAEEGD EAVVRFADHL IYVPRTPTLL
TPLVTTVPLQ VFAAEIAAAR GHDVDQPRNL AKSVTVE