Gene Sare_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4223 
SymbolguaA 
ID5704394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4792893 
End bp4794446 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content69% 
IMG OID641273642 
ProductGMP synthase 
Protein accessionYP_001538995 
Protein GI159039742 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0655069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0620226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC CGCGCCCCGT CCTGGTGGTG GACTTCGGAG CCCAGTACGC CCAGCTCATC 
GCGCGCCGGG TGCGGGAGGC CCGGGTCTAC TCGGAGATCG TCCCGCATTC GATGCCGGTG
GCCGAGATGC TGGCGAAGGA CCCGGCAGCG ATCATTCTCT CCGGCGGCCC GTCCAGCGTT
TACGTGCCGA ATGCGCCGCA GGTCGACGCC GGGGTGTTCG AGGCCGGTGT GCCAGTCTTC
GGTATCTGTT ACGGCTTCCA GGCGATGGCC CGGGCCCTTG GGGGCACGGT CGCAAGGACC
GGCAACCGGG AGTACGGGGG TACCCCGCTG CGTCCGCGGC CGGATCCCGG GGCGTTGCTC
CGTGACCTTC CCGGTGACCT GCCGGTCTGG ATGAGCCACG GCGACTGTGT GACGGAGGCG
CCGCCCGGTT TCGTGGTGAC CGCCGAGTCG GCGGGGGCTC CGGTGGCAGC CTTCGAGGAC
CCAACCGGGC GGCGGGCCGG GGTGCAGTTC CATCCGGAGG TCGGGCACAC GGCGCACGGC
CAGGAGATGC TGACCCGTTT CCTCTACGAC ATCGCCGGCA TCGAGCCCAC CTGGACGCCG
GAGAACATCA TCGACGAGCA GGTGGCGCGG ATCCGTGAGC AGGTCGGCAC CAAGGAGGTC
ATCTGTGGCC TGAGTGGTGG TGTCGACTCC GCGGTCGCCG CGGCGCTGGT GCACCGGGCC
GTCGGTGACC AACTGACCTG CGTTTTTGTT GACCATGGTC TGCTCCGGGC GGGTGAGGCC
GAGCAGGTGG AGAAGGACTA CGTTGCCGCC ACCGGAATCA GGCTGAAGGT GGTCGACTCA
GCTGACCGTT TCCTGACGGC CCTGGCCGGC GTGTCCGATC CCGAGCGGAA GCGCAAGATC
ATCGGCCGGG AGTTCATTCG GACCTTCGAG GCCGCAGCCC GGGACATCGC CGCGCACGGT
GATGTGGAGT TCCTGGTCCA GGGCACCCTC TACCCGGATG TGGTGGAGTC CGGTGGCGGT
GCGGGCACCG CCAGCATCAA GAGCCACCAC AATGTCGGCG GACTTCCGGA GGACCTCGGG
TTCTCCCTGG TCGAACCGCT TCGCACGCTC TTCAAGGACG AGGTCCGCGC GCTCGGCCTT
CAACTCGGCC TGCCGGAGGC GATGGTCTGG CGGCACCCGT TCCCCGGGCC GGGGCTCGCC
ATCCGGATCA TCGGGGCGGT CGACCGGGAG CGGCTCGACG TGCTCCGCCG GGCCGACTTC
ATCGCTCGGC AGGAACTCAG CGCCGCCGGT CTGGACCGTG GTGTGTGGCA GTTCCCGGTG
GTTCTCCTGG CGGACGTGCG CAGCGTGGGT GTGCAGGGTG ACGGGCGCAG CTACGGGCAC
CCGGTGGTCC TGCGTCCGGT TTCCAGTGAG GACGCGATGA CGGCCGACTG GTCGCGGCTG
CCGTACGACC TGATCGCTCG GATCTCCACT CGGATCACGA ATGAGGTCGC CGAGGTGAAC
CGGGTGGTTC TGGACGTGAC CAGCAAGCCG CCGGGCACCA TCGAGTGGGA GTGA
 
Protein sequence
MSTPRPVLVV DFGAQYAQLI ARRVREARVY SEIVPHSMPV AEMLAKDPAA IILSGGPSSV 
YVPNAPQVDA GVFEAGVPVF GICYGFQAMA RALGGTVART GNREYGGTPL RPRPDPGALL
RDLPGDLPVW MSHGDCVTEA PPGFVVTAES AGAPVAAFED PTGRRAGVQF HPEVGHTAHG
QEMLTRFLYD IAGIEPTWTP ENIIDEQVAR IREQVGTKEV ICGLSGGVDS AVAAALVHRA
VGDQLTCVFV DHGLLRAGEA EQVEKDYVAA TGIRLKVVDS ADRFLTALAG VSDPERKRKI
IGREFIRTFE AAARDIAAHG DVEFLVQGTL YPDVVESGGG AGTASIKSHH NVGGLPEDLG
FSLVEPLRTL FKDEVRALGL QLGLPEAMVW RHPFPGPGLA IRIIGAVDRE RLDVLRRADF
IARQELSAAG LDRGVWQFPV VLLADVRSVG VQGDGRSYGH PVVLRPVSSE DAMTADWSRL
PYDLIARIST RITNEVAEVN RVVLDVTSKP PGTIEWE