Gene Rcas_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1739 
SymbolguaA 
ID5539217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2236456 
End bp2237994 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content62% 
IMG OID640893878 
ProductGMP synthase 
Protein accessionYP_001431849 
Protein GI156741720 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.929225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAAT CAATCCCAGT TCTCGATTTT GGCTCGCAGA CAGCGCAACT GATCGTCCGC 
CGCCTGCGTG AACTTGGCGT GTACAGCGAA CTGTTGCCGC ACGACACCCC AGAAGCCGAC
GTGTGGGCGT TGCAACCACG CGGCATTGTT CTTTCCGGCG GACCGGCAAG CGTCTATGAG
CCAGGCGCGC CGCAGTTGCC GCCATGGCTG CTCGAAAGCG ACCTGCCGGT GCTTGGTATT
TGCTACGGGA TGCAGTTGCA GGCACACACC CTCGGTGGGC GCGTCGAAGG TATGCAGAGC
CGTGAGTTTG GTCCGGCAGA GATCACCGTC GTCGATCCCG ATCTGCTGTT CGCCGATATG
CCGACACAAC AACAGGTGTG GATGAGCCAC GGCGATCACA TTGCTGCGCT GCCCCCTGGA
TTTCGCGTGC TGGCACACAG CCCCGGCGCG CCATTTGCTG CCGCAGGCGA CGACCGACGT
CGCTGGTATG GCATTCAGTT CCATCCCGAA GTCGTGCATA CGCGCTTCGG GCGCGACATA
TTGCGCAACT TCGCCTTCCG TATTTGCAAA TGCCGCGGCG ACTGGCAACC GGAAAACTTT
GTCGCTGAGG CAATCGAGCG CGTGCGCGCG CAGGTCGGCG ATGGGCGGGT GATCTGTGCG
CTTTCCGGCG GCGTCGACTC GGCGGTTGCC GCGCTGATCG TCCATCACGC CATCGGCGAC
CGGTTGACGT GCGTTTTTGT GGACAATGGT TTGCTGCGCC AGGGTGAAGC CGAACAGGTT
GTCGCCACCT TCCGTGAGCA TTTTCATATT CCCCTGATCG CCGTCGATGC AGCAGATGAA
TTTCTCGAAG CGCTTGCTGG CGTTGCCGAC CCGGAACAGA AGCGCACAAT CATCGGCGAA
AAGTTCGTGC GCATCTTCGA ACGTGAAGCG CGCCGCATCG AAGGCGCGCG CTTCCTCGCG
CAGGGCACGC TTTACCCCGA CGTGATCGAA AGCAGAGCGC CGGATCGCCA GAAAGGCGTA
ACCATCAAAA CCCACCACAA TGTCGGCGGA TTGCCCGCCG ATATGCAGTT GACCCTCGTC
GAACCATTGC GCTACCTGTT CAAGGACGAA GTGCGCGCCG CCGGTCATGC CCTGGGGCTG
CCGGACGAAT GGGTCTGGCG GCATCCCTTC CCTGGACCAG GGCTTGCCGT GCGGGTGCTT
GGTCCGGTGA CGCGCGAGCG CCTCGCAACG CTGCGCGCTG CCGACGCCAT TTTCATGCAG
GAATTGCGCA TTGCCGGATT ATACCGCGCA ACGCAACAGG CGTTTGCAGT GCTGTTGCCG
GTACGCAGCG TCGGCGTGAT GGGCGATGGA CGCACCTACG CTGATGTGGT GGCGCTCCGC
GCCGTGACGA CCGAGGATTA TATGACGGCG GATTGGGCGC GCCTCCCCGC TGAACTGCTG
GCGCGCGTGA GCAGCCGCAT TGTGAACGAG GTTCCCGGCG TCAATCGTGT GGTGTACGAC
ATCTCCTCCA AACCCCCGGC AACGATTGAG TGGGAATAG
 
Protein sequence
MHESIPVLDF GSQTAQLIVR RLRELGVYSE LLPHDTPEAD VWALQPRGIV LSGGPASVYE 
PGAPQLPPWL LESDLPVLGI CYGMQLQAHT LGGRVEGMQS REFGPAEITV VDPDLLFADM
PTQQQVWMSH GDHIAALPPG FRVLAHSPGA PFAAAGDDRR RWYGIQFHPE VVHTRFGRDI
LRNFAFRICK CRGDWQPENF VAEAIERVRA QVGDGRVICA LSGGVDSAVA ALIVHHAIGD
RLTCVFVDNG LLRQGEAEQV VATFREHFHI PLIAVDAADE FLEALAGVAD PEQKRTIIGE
KFVRIFEREA RRIEGARFLA QGTLYPDVIE SRAPDRQKGV TIKTHHNVGG LPADMQLTLV
EPLRYLFKDE VRAAGHALGL PDEWVWRHPF PGPGLAVRVL GPVTRERLAT LRAADAIFMQ
ELRIAGLYRA TQQAFAVLLP VRSVGVMGDG RTYADVVALR AVTTEDYMTA DWARLPAELL
ARVSSRIVNE VPGVNRVVYD ISSKPPATIE WE