Gene Ccel_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2202 
SymbolguaA 
ID7310890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2574335 
End bp2575870 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content42% 
IMG OID643609134 
ProductGMP synthase 
Protein accessionYP_002506524 
Protein GI220929615 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACAATG AAATGATATT AGTTCTTGAT TTCGGCGGAC AGTACAATCA GCTGATAGCA 
CGCCGAGTGA GAGAAGCAAA TGTTTATTGC GAGGTAATTC CTTATAACGC ATCTTTAGAA
CGTATAAAAT CATACAACGC AAAAGGAATA ATTTTTACTG GTGGGCCCAA TTCAGTTTTG
GATGAGGGAG CACCAAAGTG TGACCCGGGT GTTTTTGAGC TGGGAATACC TGTTCTGGGC
ATATGCTATG GTATGCAGCT CATGAGTGTT ATGCTGGGCG GAAGTGTAAC TGCTGCTAAT
CAGCGTGAAT ATGGAAAGGT TGAAATTTGT GTAGATAAAT CACAGCCTTT GTTCAGGGAT
GTGGACGAAA ATACAATATG CTGGATGAGT CATACCTACT ATGTTGATAC ACCTCCTAAG
GGTTTTGAAG TAATAGCAAA GTCAGCAAAT TGTCCAACAG GTGCAATGCA GCATGTTGAA
AAGAACCTCT ATGCGGTCCA GTTCCACCCG GAGGTAATGC ATACGCCTAA AGGAAAAGAA
ATGCTTAAAA ACTTCCTATA CAATATTTGC GGCTGTAAAG GCGACTGGAA GATGTCATCA
TTTGTTGAAA ACTCGATTAA TGCAATACGT GAGAAGGTTG GAGACAAAAA GGTATTGTGT
GCACTGTCCG GCGGGGTTGA TTCATCTGTG GCGGCAGTAC TGATTCATAA GGCTATTGGA
AAGCAGCTGA CTTGTATATT TGTTGACCAT GGACTTTTGA GAAAATATGA GGGAGACCAG
GTTGAGCAGA TTTTCAGAAA GCAGTATGAT ATCAACCTGA TTCGTGTAAA TTGTGAAGAC
AGATTTTTGC AGAGACTTAA AGGCGTTTCT GATCCCGAAA CCAAAAGAAA AATTATTGGT
GAAGAATTTA TAAGAGTATT TGAAGATGAG GCAAAGAAAA TTGGAAAGGT TGATTTCCTT
GTTCAGGGAA CAATTTATCC TGATGTCATT GAAAGCGGAA TCGGTGATGC AGCCGTTATA
AAGAGCCATC ATAATGTTGG AGGTCTGCCA GAACATGTTG ATTTTAAGGA GATTATCGAA
CCTCTCAGAA GCCTTTTCAA GGACGAAGTA AGAAGGGCGG GAGAGGAATT GGGTATTCCT
GAAGACTTGG TTTGGAGACA GCCATTCCCG GGCCCCGGAC TTGCTATAAG GGTTATCGGT
GATTTGACAA AAGAAAAGCT GGACACTCTA AGAGATACCG ACTACATTTT CCGTGAAGAA
ATCAAGGCTG CCGGATTGGA CAAAGAGATT AACCAGTATT TCACAGTTTT GACAAATATG
CGAAGTGTAG GCGTGATGGG TGACGAAAGA ACTTATGACT ACGCCTTGGC ACTGCGTGCG
GTAACGACCA CCGACTTTAT GACAGCCGAC TGGGCAAGAA TCCCATACGA TATTCTGGAG
AAGGTCTCCA CTCGTATTGT CAACGAAGTC AAGCAAATCA ACAGAATTGT GTATGATATC
ACCTCGAAGC CACCAGCTAC GATTGAGTGG GAATAA
 
Protein sequence
MNNEMILVLD FGGQYNQLIA RRVREANVYC EVIPYNASLE RIKSYNAKGI IFTGGPNSVL 
DEGAPKCDPG VFELGIPVLG ICYGMQLMSV MLGGSVTAAN QREYGKVEIC VDKSQPLFRD
VDENTICWMS HTYYVDTPPK GFEVIAKSAN CPTGAMQHVE KNLYAVQFHP EVMHTPKGKE
MLKNFLYNIC GCKGDWKMSS FVENSINAIR EKVGDKKVLC ALSGGVDSSV AAVLIHKAIG
KQLTCIFVDH GLLRKYEGDQ VEQIFRKQYD INLIRVNCED RFLQRLKGVS DPETKRKIIG
EEFIRVFEDE AKKIGKVDFL VQGTIYPDVI ESGIGDAAVI KSHHNVGGLP EHVDFKEIIE
PLRSLFKDEV RRAGEELGIP EDLVWRQPFP GPGLAIRVIG DLTKEKLDTL RDTDYIFREE
IKAAGLDKEI NQYFTVLTNM RSVGVMGDER TYDYALALRA VTTTDFMTAD WARIPYDILE
KVSTRIVNEV KQINRIVYDI TSKPPATIEW E