Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1739 |
Symbol | guaA |
ID | 5539217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2236456 |
End bp | 2237994 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893878 |
Product | GMP synthase |
Protein accession | YP_001431849 |
Protein GI | 156741720 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.929225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGAAT CAATCCCAGT TCTCGATTTT GGCTCGCAGA CAGCGCAACT GATCGTCCGC CGCCTGCGTG AACTTGGCGT GTACAGCGAA CTGTTGCCGC ACGACACCCC AGAAGCCGAC GTGTGGGCGT TGCAACCACG CGGCATTGTT CTTTCCGGCG GACCGGCAAG CGTCTATGAG CCAGGCGCGC CGCAGTTGCC GCCATGGCTG CTCGAAAGCG ACCTGCCGGT GCTTGGTATT TGCTACGGGA TGCAGTTGCA GGCACACACC CTCGGTGGGC GCGTCGAAGG TATGCAGAGC CGTGAGTTTG GTCCGGCAGA GATCACCGTC GTCGATCCCG ATCTGCTGTT CGCCGATATG CCGACACAAC AACAGGTGTG GATGAGCCAC GGCGATCACA TTGCTGCGCT GCCCCCTGGA TTTCGCGTGC TGGCACACAG CCCCGGCGCG CCATTTGCTG CCGCAGGCGA CGACCGACGT CGCTGGTATG GCATTCAGTT CCATCCCGAA GTCGTGCATA CGCGCTTCGG GCGCGACATA TTGCGCAACT TCGCCTTCCG TATTTGCAAA TGCCGCGGCG ACTGGCAACC GGAAAACTTT GTCGCTGAGG CAATCGAGCG CGTGCGCGCG CAGGTCGGCG ATGGGCGGGT GATCTGTGCG CTTTCCGGCG GCGTCGACTC GGCGGTTGCC GCGCTGATCG TCCATCACGC CATCGGCGAC CGGTTGACGT GCGTTTTTGT GGACAATGGT TTGCTGCGCC AGGGTGAAGC CGAACAGGTT GTCGCCACCT TCCGTGAGCA TTTTCATATT CCCCTGATCG CCGTCGATGC AGCAGATGAA TTTCTCGAAG CGCTTGCTGG CGTTGCCGAC CCGGAACAGA AGCGCACAAT CATCGGCGAA AAGTTCGTGC GCATCTTCGA ACGTGAAGCG CGCCGCATCG AAGGCGCGCG CTTCCTCGCG CAGGGCACGC TTTACCCCGA CGTGATCGAA AGCAGAGCGC CGGATCGCCA GAAAGGCGTA ACCATCAAAA CCCACCACAA TGTCGGCGGA TTGCCCGCCG ATATGCAGTT GACCCTCGTC GAACCATTGC GCTACCTGTT CAAGGACGAA GTGCGCGCCG CCGGTCATGC CCTGGGGCTG CCGGACGAAT GGGTCTGGCG GCATCCCTTC CCTGGACCAG GGCTTGCCGT GCGGGTGCTT GGTCCGGTGA CGCGCGAGCG CCTCGCAACG CTGCGCGCTG CCGACGCCAT TTTCATGCAG GAATTGCGCA TTGCCGGATT ATACCGCGCA ACGCAACAGG CGTTTGCAGT GCTGTTGCCG GTACGCAGCG TCGGCGTGAT GGGCGATGGA CGCACCTACG CTGATGTGGT GGCGCTCCGC GCCGTGACGA CCGAGGATTA TATGACGGCG GATTGGGCGC GCCTCCCCGC TGAACTGCTG GCGCGCGTGA GCAGCCGCAT TGTGAACGAG GTTCCCGGCG TCAATCGTGT GGTGTACGAC ATCTCCTCCA AACCCCCGGC AACGATTGAG TGGGAATAG
|
Protein sequence | MHESIPVLDF GSQTAQLIVR RLRELGVYSE LLPHDTPEAD VWALQPRGIV LSGGPASVYE PGAPQLPPWL LESDLPVLGI CYGMQLQAHT LGGRVEGMQS REFGPAEITV VDPDLLFADM PTQQQVWMSH GDHIAALPPG FRVLAHSPGA PFAAAGDDRR RWYGIQFHPE VVHTRFGRDI LRNFAFRICK CRGDWQPENF VAEAIERVRA QVGDGRVICA LSGGVDSAVA ALIVHHAIGD RLTCVFVDNG LLRQGEAEQV VATFREHFHI PLIAVDAADE FLEALAGVAD PEQKRTIIGE KFVRIFEREA RRIEGARFLA QGTLYPDVIE SRAPDRQKGV TIKTHHNVGG LPADMQLTLV EPLRYLFKDE VRAAGHALGL PDEWVWRHPF PGPGLAVRVL GPVTRERLAT LRAADAIFMQ ELRIAGLYRA TQQAFAVLLP VRSVGVMGDG RTYADVVALR AVTTEDYMTA DWARLPAELL ARVSSRIVNE VPGVNRVVYD ISSKPPATIE WE
|
| |