Gene Cphy_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2643 
Symbol 
ID5742810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3228999 
End bp3230090 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content35% 
IMG OID641293735 
Productmicrocompartments protein 
Protein accessionYP_001559743 
Protein GI160880775 
COG category[C] Energy production and conversion
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4577] Carbon dioxide concentrating mechanism/carboxysome shell protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCAT TAGGAATGAT CGAGGTGATT GGATTGCCAC CTGCAATTGA AGCAGCGGAT 
TCTGCTCTTA AAGCAGCAAA CGTTTCTTTA CTTGCAATAG CAAATGCGGA TGCGGGAATC
CTAACTGTAG AAATCACCGG AGATGTTGGG GCAGTTTTAG CTGCTGTAGA TGCAGGCGCC
GCAGCAGCAG AACGTGTTGG TACGCTTCGT GCAAAACATG TGATACCGCA TGTAGATGAA
AGTCTTACGG ATAAGGTATT AATGAAGGGA ACCAAACTTT TTCAACCAAA GAACAAACAG
GTAATGTCTA GCGGTCAAGG CGTTGATTCT GGGTTTACTA ACTCTCAAGC AGATGGTGCC
TCGAATGCCT CTAAGGCTCA TGATACGAGT GTATTTTCAA TAGATTCTAT TCAAAAAGAA
GGTAGTGTAC AAACAGATGG AATGAATTCT AGAGTAGAAG ATACTCAGAA AAAAGAGAGT
ACTTCAAAGG AAGAGAGTAC TGTAAAGCAA GATAATTCTT TAAAGGAAAA AAGTGCTTTA
TTAGATAATG TTTTGTTAGA AGATAATGTG TTAAAAGAAA ACGTTTTGAA AGAAGATAGT
GCTTTAAAAA ATAATGCTTT CAAAAATAAT TATGTAAAAG AAGATAGTGC GTTAAAAGAA
GAAAGTACTT TAAAAGAAGA AAGTACTTTA AAAGATAGTG TTACAGAAGA AGATAGTGTT
TTAAAAGATA AAAACATTAT TAAAAAGGAT AGCACTATAA AAGAAGGTAG CACTATAAAA
GAAGATGGCA TTATAAAAGA AGATGGCACT AAAAAAGATA GCACTATAAA AGTAGAAAAT
AGAAACGATG AAATAAATGC TGTCTCACAG AACACTACTA ACTTTGGAAG AGATGATATC
TTTGGTGAAA GTACAATCAC AAGTAAAAAG AATGACCAAC AGAACTATAC CGTTAATGAT
TTAAAGAAAA AGAGTAATGA TTCACTTCGT GCAATCTTAC AAAGTAAAGG CGTGGAATTA
ACAGAGCTTC ATAAGAGTGC AAAGAAACAG GAATTAATTC AACTAATTAT AGCGCAGCAA
AATCGTAGAT AA
 
Protein sequence
MQALGMIEVI GLPPAIEAAD SALKAANVSL LAIANADAGI LTVEITGDVG AVLAAVDAGA 
AAAERVGTLR AKHVIPHVDE SLTDKVLMKG TKLFQPKNKQ VMSSGQGVDS GFTNSQADGA
SNASKAHDTS VFSIDSIQKE GSVQTDGMNS RVEDTQKKES TSKEESTVKQ DNSLKEKSAL
LDNVLLEDNV LKENVLKEDS ALKNNAFKNN YVKEDSALKE ESTLKEESTL KDSVTEEDSV
LKDKNIIKKD STIKEGSTIK EDGIIKEDGT KKDSTIKVEN RNDEINAVSQ NTTNFGRDDI
FGESTITSKK NDQQNYTVND LKKKSNDSLR AILQSKGVEL TELHKSAKKQ ELIQLIIAQQ
NRR