Gene Cphy_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3022 
Symbol 
ID5743348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3690365 
End bp3692338 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content36% 
IMG OID641294123 
Productheparinase II/III family protein 
Protein accessionYP_001560118 
Protein GI160881150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000040262 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG AAAAATTGCA AAAATTCTGG TGTGGTAACA ATGTTAATAA AGCCGCAGAA 
TTTTTTAAAG ATAATTATAG TTTAAGTGAG AAAGAGCTAA TTGAACAGGC TAATTTAGTA
TGTGATAATA CCTTTGTATT TAGAGAACAC TGGGAAATGG AGAGAACAAA CCAACCGGTT
ACATTTAACG GTAATGTGGA ATGGGACTGT ATTCCTTCTG GTGATCCGGA ATGGACTTTC
GCATTAAACC GTCATACCTG TTTTGTAAAT CTAGCAAAGG CATGGAACTG TACTAAAAAT
GAGAAGTATG CTGAAAAATT CATGGAACTT GCGAAGGACT GGATGGAACG GGTACCGCTT
ACTGAGGAAA GTAAGCAAAA TACTTGGAGA AGTATTGAGG CGGGAATACG ATGTGAGAAT
TGGTTAAAGA GCATGATGCT TTTTGCAGAT TGCGGACAGA TACCGGAAAG CTTCTGGGAT
GAATTTGAAA GGATACTTTT CTTACATGGA GAATATCTTA TGTCTGTAAA CGGAGTCTTT
CATAGGCTAA GCAACTGGGG AATTCTTCAA AATCATGGAC TTTTACTTGC AGGAATTTAT
TTTGAAAATA ATGATTGGAT AAAAGAAGCA GTAAAAAGAC TGGAAGAGGA ATTGTATGTA
CAGATCTTTG AAGACGGTAC ACAATGGGAA CAAAGCCCGA TGTATCATGG AGAAGTATTA
CATTGCAGCA TAGACAGTAT AGAACATATG AAACGCTTTG GAATTCAGAT ACCGGCACAG
ATGAGCAAAA AAGTAGAGAA CATGTTATAT GCACTAGCTA TGTGGTGTAA ACCAAATGGG
AACATTCCTT GTCAGTCTGA CAGTGATGAT ATCGATGCCG GTGATTTACT TGTGCATGGA
GCATGTTTCT ATAATGATGG TATATTAAAA TTTTTATCAG ATACTAGATT CAGAGAAGAA
AATATATGGA ATCTTGGGAT AGAGGACTAT TTATTATATG AAAAAATAGA AGCCTTAGCA
CCAAAAGAAA CCTCCTATGC ATTAGCAGAT AGTGGTAATT ATATGCTAAG AAGCGGATTT
GATAAGGATG CAGATTATTT AAGATTTCAT AATGGATGTA TGGGAAGCGG ACATGGACAT
GGAGACCTAT TGCATATTGA TTTATTTTCC CATGGAGAGG ATATTTTAAT TGACACAGGC
CGCTATACTT ATGTAGATTC AAAAATAAGA AGAGATTTTA AGAGCCCTTC TGCCCACAAT
ACCATATGCG TAGATGATGA AGAATTTTCT GTATGTGCAA ACAGCTGGGG ATATGAAAAG
ATGGCTCAAC CGATAAAGGG AGAATACCGT TTTACTGAAA TGGCGGATTT TGTATCCGGT
GCCCATTTAG GTTATATAGA GAAAGGAATC TTTCTATCTA GAAAAGTAGT TTATATTAAG
CCAAGTATAT ACGTAATAAT GGATGAATGC TATGGTAGTG GCACGCATAG ATATATGCAA
AATTGGCATT TCTCTGGAGA TGGAAATATT ATTCCTGACG AAAAACAAGT AACCTTTGAG
GGAAAAAAGG GTAATGCAAA ATTCTTCTTC CTTCATGGAC AAATCGCGCT ATCTAAAAAA
GAATGGTCTG CGGAATATAA CAGTTTAAAA CCTTGTAACC ATGTTTCGGT TTCCATGGAG
GAAACAGGAA GTACTTCAAT GATTACTGTG ATTTCTACTG GGGAGAAGGG CGATAATCAA
GATGTTATAG TAGAGGAAAT TCCTGTCTCA CTAGAAAAGA GCGGGAAGAT ATTAACCGAT
AAACAGGCAG AAGCAATTAG AATTAAAATA AACGAAGAAG AATATGTTGT TGTATTTTTA
CATGAGGAAA TTATTAGCGA AGTGGATTTG ATATTCGCAG GTGGTTATAG TTCTTATGGA
AAGGTGTTAT TATTCTCTAA TAAGAATCCA AGAGGCATCT GTTTACAATA TTAA
 
Protein sequence
MRREKLQKFW CGNNVNKAAE FFKDNYSLSE KELIEQANLV CDNTFVFREH WEMERTNQPV 
TFNGNVEWDC IPSGDPEWTF ALNRHTCFVN LAKAWNCTKN EKYAEKFMEL AKDWMERVPL
TEESKQNTWR SIEAGIRCEN WLKSMMLFAD CGQIPESFWD EFERILFLHG EYLMSVNGVF
HRLSNWGILQ NHGLLLAGIY FENNDWIKEA VKRLEEELYV QIFEDGTQWE QSPMYHGEVL
HCSIDSIEHM KRFGIQIPAQ MSKKVENMLY ALAMWCKPNG NIPCQSDSDD IDAGDLLVHG
ACFYNDGILK FLSDTRFREE NIWNLGIEDY LLYEKIEALA PKETSYALAD SGNYMLRSGF
DKDADYLRFH NGCMGSGHGH GDLLHIDLFS HGEDILIDTG RYTYVDSKIR RDFKSPSAHN
TICVDDEEFS VCANSWGYEK MAQPIKGEYR FTEMADFVSG AHLGYIEKGI FLSRKVVYIK
PSIYVIMDEC YGSGTHRYMQ NWHFSGDGNI IPDEKQVTFE GKKGNAKFFF LHGQIALSKK
EWSAEYNSLK PCNHVSVSME ETGSTSMITV ISTGEKGDNQ DVIVEEIPVS LEKSGKILTD
KQAEAIRIKI NEEEYVVVFL HEEIISEVDL IFAGGYSSYG KVLLFSNKNP RGICLQY