Gene CPR_2261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2261 
SymbolguaB 
ID4204750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2480886 
End bp2482340 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content34% 
IMG OID642566813 
Productinosine 5'-monophosphate dehydrogenase 
Protein accessionYP_699537 
Protein GI110802266 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA TATTAAAAAC AGCATATACA TTTGATGATG TATTACTAGT ACCAAACAAA 
TCAGAGGTCC TACCTAATGA GGTTTCTTTA AAAACTCAAT TAACTAAAAA AATTCAATTA
AACATTCCAT TAATGAGTGC AAGTATGGAT ACAGTTACTG AATCTAAAAT GGCAATAGCT
ATGGCTAGAG AAGGTGGAAT AGGAATAATC CATAAAAACA TGACAATAGA AGATCAAGCA
AGAGAAGTTG ATAGAGTAAA AAGACAAGAA AATGGAGTAA TAACAGATCC AATATTCTTA
TCAGAAGATC ACACAGTAAG AGAAGCTTTA GATCTAATGG CTCAATACAG AATTTCAGGA
GTTCCTGTAA CAAGAGAAGG AAAACTAGTT GGAATCATAA CAAACAGAGA CATAGTATTT
GAAACAAACT ACGATAAAAA AGTTTCAGAA GTTATGACAA AAAGCCCATT AGTTACTGCG
AAAGAAGGAA CTACATTAAC AGAAGCATTA GAAATATTAA AGCAACATAA AATAGAAAAA
CTTCCTTTAA TTGATGATGA AAATAATCTA AAGGGACTTA TAACTATAAA AGATATAGAA
AAAGCTAAAG CTTTCCCAAA TGCAGCTAAA GATGAAAAAG GAAGACTTTT ATGTGGAGCT
TCAATTGGTG TAACTAATGA CATGATGGAA AGAGTAGATG CAGTAGTTAA AGCAAAGGTT
GACGTTATTG TTTTAGATAC AGCTCATGGT CATTCAAAAG GAGTTATTGA AGGAGTTAAG
AGAATAAAAG CTAAGTATCC TGAACTTCAA GTAATAGCTG GTAACATAGC TACACCAGAG
GCTGTAAGAG ATTTAGCAGA AGCAGGTGCA GATTGTGTCA AAGTAGGTAT AGGACCTGGA
TCAATATGTA CTACAAGAAT AGTTGCAGGT GTTGGAGTTC CTCAATTAAC AGCAGTTATG
GATTGTGCTG AAGAAGGTAA AAAATTAGGA ATACCAGTAA TAGCTGACGG TGGATTAAAA
TATTCAGGAG ATATAGTTAA GGCTTTAGCA GCAGGAGCTT GTGCAGCTAT GATGGGATCA
ATCTTTGCAG GATGTGAAGA AGCACCAGGA GCTATAGAAA TATATCAAGG TAGAAGCTAT
AAAGTTTACA GAGGAATGGG TTCACTTGGA GCAATGGCAA AAGGATCAAG TGATAGATAT
TTCCAAAATG GTACTAAGAA ATTCGTTCCA GAAGGTGTAG AAGGAAGAAT TGCTTACAAA
GGACATTTAG CAGATACTAT ATACCAATTA ATAGGTGGAA TAAAATCAGG AATGGGTTAC
TTAGGAGCAC CAACTTTAGA TAATTTATAT GAAAATGCTA ATTTTGTTGT TCAAACTTCA
GCAGGATTTA GAGAAAGTCA CCCTCATGAT ATAAACATAA CTAAAGAAGC ACCAAACTAC
AGTGTTAACC AATAA
 
Protein sequence
MARILKTAYT FDDVLLVPNK SEVLPNEVSL KTQLTKKIQL NIPLMSASMD TVTESKMAIA 
MAREGGIGII HKNMTIEDQA REVDRVKRQE NGVITDPIFL SEDHTVREAL DLMAQYRISG
VPVTREGKLV GIITNRDIVF ETNYDKKVSE VMTKSPLVTA KEGTTLTEAL EILKQHKIEK
LPLIDDENNL KGLITIKDIE KAKAFPNAAK DEKGRLLCGA SIGVTNDMME RVDAVVKAKV
DVIVLDTAHG HSKGVIEGVK RIKAKYPELQ VIAGNIATPE AVRDLAEAGA DCVKVGIGPG
SICTTRIVAG VGVPQLTAVM DCAEEGKKLG IPVIADGGLK YSGDIVKALA AGACAAMMGS
IFAGCEEAPG AIEIYQGRSY KVYRGMGSLG AMAKGSSDRY FQNGTKKFVP EGVEGRIAYK
GHLADTIYQL IGGIKSGMGY LGAPTLDNLY ENANFVVQTS AGFRESHPHD INITKEAPNY
SVNQ