Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0300 |
Symbol | guaA |
ID | 4570790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 331874 |
End bp | 333415 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639764899 |
Product | GMP synthase |
Protein accession | YP_910786 |
Protein GI | 119356142 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.176102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCGG TAACCGTTCT TGACTTTGGA TCTCAATATA CCCAGCTTAT CGCCCGACGT ATCCGCGAAC TGGGTATCTA TTCGGAAATA CTGCCGTACA ATGCCTCTCC TGAAACCATT CGGGAACATG ACCCGAAAGC AATTATCCTC TCGGGAGGCC CGACAAGCGT CTATGGCGAT TCTGCCCTTT TACCTGACGG TGGAGTATTC ATGCTGGGAG TACCGGTTCT CGGGATCTGC TATGGTCTGC AGGCCATAGC AAAACATTTC GGAGGGGAGG TTGCCGGCTC TTCTAAACAG GAGTTTGGCA GGGCTAAAAT GCTGGTCAGT CATAACGAAG AGTCTGAAAG CCCGCTTTTT CGAAACATTC CTGATTCGGA TGTCTGGATG AGCCACGGAG ACAAGGTGGT CAGACTCCCT GAAGGGTTCA GGGTCACCGC AAGCAGCGAA AACTCGGAAA TGTGCGCTCT TGAAAGTTAT GGGTCAAAAG CCGCTCTCAA GGTTTATGGT CTCCAGTTCC ATCCTGAAGT ACAGCATACC CTTTATGGCA AACAACTGCT CTCCAACTTT CTCATCGATA TTGCTGGCAT TAAACCCGAT TGGTCGCCGA AAAGCTTTAT CGGTCACCAG ATTGAGGAAA TCAGGGCTCG CGCCGGAAAA GACAAGGTGA TTTGCGGCAT CAGCGGCGGC GTTGACTCAA CAGTCGCAGC AGTGCTTGTA AGCCAGGCTA TAGGAAAACA GCTGCACTGC GTTTTTGTTG ATAACGGGCT GCTTCGTAAA AATGAAGCGG TCAAAGTGAT GAATTTTCTC AAACCACTCG GCCTCTCCGT CACCCTTGCC GATGCCTCGG ATCTGTTTCT CAAAAGACTT GATAAGGTGG CCTCTCCGGA AAAAAAGAGA AAGATTATCG GCAGAACCTT CATTCATGTA TTCGAACAGC ATCTGAACGA GGAGAAATAT CTTGTTCAGG GCACGCTCTA TCCCGATGTC ATTGAGAGCG TCAGCGTCAA GGGTCCTTCA GAAACCATCA AGTCGCACCA TAACGTTGGC GGCCTGCCGA AGCGCATGAA ACTGAAACTC ATAGAACCGC TCCGGGAGCT TTTCAAGGAC GAGGTACGGG CTGTGGGTCG TGAACTTGGT ATTGCTGAAG ATATTCTCAT GCGCCACCCG TTCCCCGGTC CGGGTCTTGC CGTCAGGGTG CTGGGCTCGG TAAGTCGCCC AAGGCTTGAC ATCCTTCGCG AGGCTGATGA AATTTTCATT GAGGAGCTTA AAACCAGCGG CCTTTACCAG CATGTCTGGC AGGCGTTTTC AGTGCTTCTA CCGGTACAGT CTGTCGGCGT CATGGGAGAC AAGCGGACGT ATGAAAATGT TCTGGCACTT CGGGCTGTTG AATCAACGGA TGGCATGACC GCCGACTGGG CGCAACTGCC TCACGATTTC CTCGCACGTG TCTCGAACCG CATCATCAAC GAAGTTCGCG GCATCAACCG GGTTGCCTAC GATATTTCCT CCAAACCGCC AGCGACCATC GAGTGGGAGT AA
|
Protein sequence | MQSVTVLDFG SQYTQLIARR IRELGIYSEI LPYNASPETI REHDPKAIIL SGGPTSVYGD SALLPDGGVF MLGVPVLGIC YGLQAIAKHF GGEVAGSSKQ EFGRAKMLVS HNEESESPLF RNIPDSDVWM SHGDKVVRLP EGFRVTASSE NSEMCALESY GSKAALKVYG LQFHPEVQHT LYGKQLLSNF LIDIAGIKPD WSPKSFIGHQ IEEIRARAGK DKVICGISGG VDSTVAAVLV SQAIGKQLHC VFVDNGLLRK NEAVKVMNFL KPLGLSVTLA DASDLFLKRL DKVASPEKKR KIIGRTFIHV FEQHLNEEKY LVQGTLYPDV IESVSVKGPS ETIKSHHNVG GLPKRMKLKL IEPLRELFKD EVRAVGRELG IAEDILMRHP FPGPGLAVRV LGSVSRPRLD ILREADEIFI EELKTSGLYQ HVWQAFSVLL PVQSVGVMGD KRTYENVLAL RAVESTDGMT ADWAQLPHDF LARVSNRIIN EVRGINRVAY DISSKPPATI EWE
|
| |