Gene CPF_2558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2558 
SymbolguaB 
ID4203566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2820518 
End bp2821972 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content34% 
IMG OID638083425 
Productinosine 5'-monophosphate dehydrogenase 
Protein accessionYP_696948 
Protein GI110800169 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA TATTAAAAAC AGCATATACA TTTGATGATG TATTACTAGT ACCAAACAAA 
TCAGAGGTCC TACCTAATGA GGTTTCTTTA AAAACTCAAT TAACTAAAAA AATTCAATTA
AACATTCCAT TAATGAGTGC AAGTATGGAT ACAGTTACTG AATCTAAAAT GGCAATAGCT
ATGGCTAGAG AAGGTGGAAT AGGAATAATC CATAAAAATA TGACAATAGA AGATCAAGCA
AGAGAAGTTG ATAGAGTAAA AAGACAAGAA AATGGAGTAA TAACAGATCC AATATTCTTA
TCAGAAGATC ACACAGTAAG AGAAGCTTTA GATCTAATGG CTCAATACAG AATTTCAGGA
GTTCCTGTAA CAAGAGAAGG AAAACTAGTT GGAATCATAA CAAACAGAGA CATAGTATTT
GAAACAAACT ACGATAAAAA AGTTTCAGAA GTTATGACAA AAAGTCCATT AGTTACTGCG
AAGGAAGGAA CTACATTAAC TGAAGCATTA GAAATATTAA AGCAACATAA AATAGAAAAA
CTTCCTTTAG TTGATGATGA AAATAATCTA AAGGGACTTA TAACTATAAA AGATATAGAA
AAAGCTAAAG CTTTCCCAAA TGCAGCTAAA GATGAAAAAG GAAGACTTTT ATGTGGAGCT
TCAATCGGTG TAACTAATGA CATGATGGAA AGAGTAGATG CAGTAGTTAA AGCAAAGGTT
GACGTTATTG TTTTAGATAC AGCTCATGGT CATTCAAAAG GAGTTATTGA AGGAGTTAAG
AGAATAAAAG CTAAATATCC TGAACTTCAA GTAATAGCTG GTAACATAGC TACACCAGAG
GCTGTAAGAG ATTTAGCAGA AGCAGGTGCA GATTGTGTTA AGGTAGGTAT AGGACCTGGA
TCAATATGTA CTACAAGAAT AGTTGCAGGT GTTGGGGTTC CTCAATTAAC AGCAGTTATG
GATTGTGCTG AAGAAGGTAA AAAATTAGGA ATACCAGTAA TAGCTGACGG TGGATTAAAA
TATTCAGGAG ATATAGTTAA GGCTTTAGCA GCTGGAGCTT GTGCAGCTAT GATGGGATCA
ATCTTTGCAG GATGTGAAGA AGCACCAGGA GCTATAGAAA TATATCAAGG TAGAAGCTAT
AAAGTTTACA GAGGAATGGG TTCACTTGGA GCTATGGCTA AAGGATCAAG TGATAGATAT
TTCCAAAATG GTACTAAGAA ATTCGTTCCA GAAGGTGTAG AAGGAAGAAT TGCTTACAAA
GGACATTTAG CAGATACTAT CTACCAATTA ATAGGTGGAA TAAAATCAGG AATGGGTTAC
TTAGGAGCAC CAACTTTAGA AAATTTATAT GAAAATGCTA ATTTTGTTGT TCAAACTTCA
GCAGGATTTA GAGAAAGTCA TCCTCATGAT ATAAACATAA CTAAGGAAGC ACCAAACTAC
AGTGTTAACC AATAA
 
Protein sequence
MARILKTAYT FDDVLLVPNK SEVLPNEVSL KTQLTKKIQL NIPLMSASMD TVTESKMAIA 
MAREGGIGII HKNMTIEDQA REVDRVKRQE NGVITDPIFL SEDHTVREAL DLMAQYRISG
VPVTREGKLV GIITNRDIVF ETNYDKKVSE VMTKSPLVTA KEGTTLTEAL EILKQHKIEK
LPLVDDENNL KGLITIKDIE KAKAFPNAAK DEKGRLLCGA SIGVTNDMME RVDAVVKAKV
DVIVLDTAHG HSKGVIEGVK RIKAKYPELQ VIAGNIATPE AVRDLAEAGA DCVKVGIGPG
SICTTRIVAG VGVPQLTAVM DCAEEGKKLG IPVIADGGLK YSGDIVKALA AGACAAMMGS
IFAGCEEAPG AIEIYQGRSY KVYRGMGSLG AMAKGSSDRY FQNGTKKFVP EGVEGRIAYK
GHLADTIYQL IGGIKSGMGY LGAPTLENLY ENANFVVQTS AGFRESHPHD INITKEAPNY
SVNQ