Gene Cphy_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1800 
Symbol 
ID5743089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2216331 
End bp2218289 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content42% 
IMG OID641292897 
Productglycoside hydrolase family protein 
Protein accessionYP_001558908 
Protein GI160879940 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0348048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTT TTAAAAAGAT TATAGGGGTA ATATTGTCAT TAGCATTGCT AATAACAATG 
ATACCTGCCG GTACAGGTAT TGTAAAGGCT GCTACAGGAA AGAACATCGT TGCTTACTTC
CCGAACTGGG GAATCTATAA TAGTGCCCAT CGCACTATGA CCGTTGGTAT GATTCCATGG
GACAAGGTTA CAGTAATAAA CCATGCCTTT TTTGAGGTAG ATTCTTCCTT TAAACTGGCA
TCAACAGATT CCTTCGCTGA CTTTGATAAG ATGATGGATC ATTCCGAAGG GTGGGATACT
AATCAATTAA GAGGACATTT CGGAGAGTAT AAATATTATA AGAATCTCTA TCCGAATGTA
AAAGTAATCG TATCCGTTGG TGGTTGGACA AGAGGTCAAA ATTTCCATGC AATGGCTGCT
ACTACTTCCA CCAGAGCGGT TTTTATCCAG AGTGTTATTG ATCTTCTGAG AAAATATCCG
TTTATCGATG GCGTTGATCT TGATTGGGAA TATCCTGGCA TTAATAGGGC ACCAGATCCT
AACGATTCCT ATGACAGAGG CTGCCCAGGA GGACCGGAGG ATAAGCAGAA TTTCACCTCT
TTACTACGTG AAATACGTCA GGCATATAAT AACAATGGCT TAAACGGTAA GTTGTTAACG
ATAGCTGCTC CTTCCGGTTA TGACAAGCTT GCCCTGCAGG AACCAGATAT CTATGCCCAG
TATCTGGACT TCATAAATGT TATGACCTAC GATATGCACG GGGCATGGGA AAATACAACA
AATCATCAGT CTCCACTCTA TGCAAATCCT AATGACCCTT GTCCTACTTC ACCTGTTGAC
ATTAAGAACA GGTATAATAC GGACTCCGCT ATGAAGACCT TACAGCAGGT ATATAAGATT
CCTGCCGAGA AACTGTTGGT AGGTTCCCCT TATTATTCCA GAGGATGGAA AGGTGTTACT
GGTGGAGTCA ACGGGATGTA TGCAAATGCA ACTGGAGCAG CAACCGGTAC CTGGGATAAT
CCACAATCAC CAGGTGGGCA ATATCCTTAT TTCACCTTAA AGACCATGGA GAACCAAAAT
GGATTTGTGA AATATCGCGA CGACACCTAT GCAAAGACGC CTTGGTTATA TAACGCATCT
CAGGGAATTG TACTTAGCTA TGAAGATACC ACTTCCTTAA GTGCAAGATG TGATTATATT
AACAGTAACG GATATGGCGG TTTAATTGTA TGGGAAATCT CAGGGGATAC CAGTAATTTT
GAATTAACAA CACTTGCTTT CCAGAAATTA ATTGGTAGTA CAACACAGAC AGTAGCAACA
CCAGTATTTA GTCCGGCAGG TGGTACTTAC ACTAATGCTC AGACAGTGAC TATTACCTGT
GCAACTTCCG GAGCTGAAAT TCGTTATACT AAAGACGGTA CCGAACCTAC GGCTTCTTCT
GCTCTTTACT CTTCTGCTCT TACAGTAAAT ACTACAACAA CCTTAAAAGC AAAAGCATTT
AAGAGTGGTA TGAACAGCTC GTTAACAACA ACGGTTGTTT ATACAATAGG TAATACCCAG
ACAGTAGCAA CACCGGTATT TAGCCCAGCG GGAGGAACTT ATTCAAGTGC TCAGACGGTA
ACCATTACCT GTGCAACCCA AGGAGCGGAA ATTCGTTATA CTACAGATGG TACGGAACCT
ACAGCTACCT CTGCTTTATA CTCTTCTCCT ATTACGGTAA GTTCCACTAC AACCATAAAG
GCTAAGGCAT TTATGACCGG TATGACTAGC TCCTCTACGG TCACACAAAC CTATACAATC
AGCTCTACAC AAATAACTGC TTGGACAATT GGAACAGCTT ATAAAGCAGG TGATCTGGTA
ACCTATAGCG GAAAGACCTA TAAATGTATC CAGCCACATA CAGCTTTGGC TGGCTGGACC
CCTGATGTGG TACCAGCATT GTGGGGTTTA GTACAATAG
 
Protein sequence
MKRFKKIIGV ILSLALLITM IPAGTGIVKA ATGKNIVAYF PNWGIYNSAH RTMTVGMIPW 
DKVTVINHAF FEVDSSFKLA STDSFADFDK MMDHSEGWDT NQLRGHFGEY KYYKNLYPNV
KVIVSVGGWT RGQNFHAMAA TTSTRAVFIQ SVIDLLRKYP FIDGVDLDWE YPGINRAPDP
NDSYDRGCPG GPEDKQNFTS LLREIRQAYN NNGLNGKLLT IAAPSGYDKL ALQEPDIYAQ
YLDFINVMTY DMHGAWENTT NHQSPLYANP NDPCPTSPVD IKNRYNTDSA MKTLQQVYKI
PAEKLLVGSP YYSRGWKGVT GGVNGMYANA TGAATGTWDN PQSPGGQYPY FTLKTMENQN
GFVKYRDDTY AKTPWLYNAS QGIVLSYEDT TSLSARCDYI NSNGYGGLIV WEISGDTSNF
ELTTLAFQKL IGSTTQTVAT PVFSPAGGTY TNAQTVTITC ATSGAEIRYT KDGTEPTASS
ALYSSALTVN TTTTLKAKAF KSGMNSSLTT TVVYTIGNTQ TVATPVFSPA GGTYSSAQTV
TITCATQGAE IRYTTDGTEP TATSALYSSP ITVSSTTTIK AKAFMTGMTS SSTVTQTYTI
SSTQITAWTI GTAYKAGDLV TYSGKTYKCI QPHTALAGWT PDVVPALWGL VQ