Gene Cagg_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1667 
SymbolguaA 
ID7268969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2033932 
End bp2035473 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content58% 
IMG OID643566509 
ProductGMP synthase 
Protein accessionYP_002463004 
Protein GI219848571 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTC ACAGTATTCC GGTGCTCGAT TTTGGTTCCC AGACGGCGCA GTTGATCGTG 
CGGCGGCTGC GCGAATTGGG CTATTATAGC GAGTTGCTTG CACACGATGC GCCGGAAGCG
CAGATCCGTG CGCTGAACCC GGTTGGAATT GTCCTTAGTG GTGGTCCGGC CAGTGTGTAT
GAGCCGGAGG CGCCAACTTT ACCGCCATGG CTCATCGAAA GCAAGCTGCC CGTACTTGGA
ATTTGCTACG GTATGCAACT AATCAGCCAC ACCCTCGGTG GTGTGGTGCG TCGTCCATCT
GGGCGTGAGT ACGGCCCGGC GATGATCACC GTCACCCAAC CCCACCCACT TTTTGCCGAT
ACCCCCACCG AACAGCCGGT CTGGATGAGT CATGGCGACC GAATCGAGCA GTTGCCCACC
GGCTTTACGG CAATCGCCGC CAGCCAGGCC ACGCCGTTTG CCGCCATTGC CGACGACCAC
CGACGCTGGT ACGGTGTCCA GTTTCACCCG GAAGTGGTGC ATACCGTGTA TGGGCGAGCA
CTTTTGACCA ACTTTGCCAA ACTATGCGGG GCAAAACCCG AATGGCAGCC GAGCAGTTTT
GTCACCGAAG CGATTGAACG GGTTCGAGCA CAGGTCGGCC CACACGGGCG CGTCATCTGC
GCCCTCTCCG GCGGGGTTGA TTCGGCAGTG GCGGCTCTGA TCATCCATCA CGCTATCGGT
GACCGGTTGA CCTGCGTGTT CGTTGATAAC GGCCTCTTGC GCGCCGGCGA AGCTGAACAG
GTCATCAACA CCTTTCGTGA ACATTTTCAC GTACCGCTGA TCGCAGTCGA TGCGCGTGAA
GAGTTTCTCG CTGCCTTAGA GGGTGTGGTT GATCCTGAGC AGAAGCGCAA GATTATCGGC
GAGAAGTTTA TTCGGATTTT CGAGCGCGAA GCGCGTTCGT TAGCAGACGT AGAGTTCCTC
GCCCAAGGGA CGCTCTACCC CGATGTAATC GAATCGACTG CACCGGACCG ACCGAAGGCA
GCAAAGATCA AAACGCATCA CAACGTTGGC GGGCTACCCG CCGACATGCA ACTGAAGCTG
GTTGAGCCGC TCCGCTACCT ATTCAAAGAT GAAGTACGCG CAGCAGGGCT GCAACTCGGC
TTGCCCGAAG AGTGGGTATG GCGACATCCT TTCCCCGGAC CCGGTCTCGC CGTGCGGATC
ATCGGTACGG TAACGTGGGA ACGGCTAGAG ACATTGCGCA AAGCCGACAG CATCTTCCTT
GAAGAGCTGC GGGCAAGCGG CTACTACCGT GCAACCCAAC AGGCATTCGC CGTTCTCCTG
CCGGTGCAAA GCGTCGGTGT GATGGGGGAC GGGCGTAGTT ATGGTTTCAC TATCGCACTG
CGCGCGATTA CCACCGAAGA CTACATGACA GCCGACTGGG CGCGCTTACC CTACGAATTA
CTGGCACACG TCAGTAGCCG AATTGTGAAT GAGGTCGAAG GCGTCAATCG CGTCGTATAC
GATATTTCGT CGAAGCCACC GGCCACTATC GAGTGGGAGT AG
 
Protein sequence
MTTHSIPVLD FGSQTAQLIV RRLRELGYYS ELLAHDAPEA QIRALNPVGI VLSGGPASVY 
EPEAPTLPPW LIESKLPVLG ICYGMQLISH TLGGVVRRPS GREYGPAMIT VTQPHPLFAD
TPTEQPVWMS HGDRIEQLPT GFTAIAASQA TPFAAIADDH RRWYGVQFHP EVVHTVYGRA
LLTNFAKLCG AKPEWQPSSF VTEAIERVRA QVGPHGRVIC ALSGGVDSAV AALIIHHAIG
DRLTCVFVDN GLLRAGEAEQ VINTFREHFH VPLIAVDARE EFLAALEGVV DPEQKRKIIG
EKFIRIFERE ARSLADVEFL AQGTLYPDVI ESTAPDRPKA AKIKTHHNVG GLPADMQLKL
VEPLRYLFKD EVRAAGLQLG LPEEWVWRHP FPGPGLAVRI IGTVTWERLE TLRKADSIFL
EELRASGYYR ATQQAFAVLL PVQSVGVMGD GRSYGFTIAL RAITTEDYMT ADWARLPYEL
LAHVSSRIVN EVEGVNRVVY DISSKPPATI EWE