Gene Cphy_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2003 
Symbol 
ID5743031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2471732 
End bp2472706 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content35% 
IMG OID641293100 
ProductNLP/P60 protein 
Protein accessionYP_001559110 
Protein GI160880142 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000114468 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA CAAAGCAACT ACTGTTAAAA CGTTTGATCG TAATGTTTGG TCTTATATTT 
TCACTTGTTT GTCTTAGTCC TAAAATAGTA ACTTGGGCAG CAAGTAATTC AGAACCAAAC
GTTGGAGTAA GTACCACAAA AGAGACACCA CTGAATATCC GTGCATCTGC CAGTACTTCC
TCTGCAAATG TTTCTTCCTT AAATCCCAAT ACTCCAATTC AAGTAATCGG AAGTTCTGGT
GACTTCTATA AAGTAATCTA TAGTACCAGT GGAAATGTTG GCTATGCACA CAAATCATAT
ATCAACATAT CATCCACCAA ATATGGAACT GTTGTTACCA ATGGAGGAAC CTTAAACTTA
CGTTCATCCG CTTCCACTTC TTCTCAGATA CTCGGTAATA TCCCAAGCCA AACTGTTTTA
CCAATTATCA GTGCAGAGGA TGGATGGTAC AAGGTTGTAT GGGGTAAATC GGTTGGATAT
GTAAGTAGCA CCTATTTTAA ATCTGGTACT TCTTCAGAAA ATTCAGAAAC AAGTAACTCT
TCTTCAACAT CCCCTACCAG AAATGAGATT GTAGAATATG CAAAAACATT TCTAGGTATA
TACTATCAAT GGGGAGGAAA TTATCCGCAA GGAAGTAGTT ACGGTTTAGA CTGTTCTCAT
TATACTTATC AAGTATTTAA GAAGTTTGGT TTAATGAATT CCTATATGGT TTCTGCTGAC
CAAGCTAATT ATGTAAAGAA AATTACACGA AGTGAATTAA AACCAGGAGA TTTAGTATTT
TTTAAATCCA AATCCAGTGG TAATGTAGTA CATGTTGCAA TCTATATTGG AGATGGACAA
ATCATAGGTG CTAATGGTGG AGATTCTAGC GTAAATTCAA TAGAAACCGC AAAGAAAAAG
AATGCAATGG TAAAGATTCA ATCCGTTGAT TATGATTCAA GAGAAAAAAT ATACGGTCGT
ATTCCAGGAC TATAA
 
Protein sequence
MLKTKQLLLK RLIVMFGLIF SLVCLSPKIV TWAASNSEPN VGVSTTKETP LNIRASASTS 
SANVSSLNPN TPIQVIGSSG DFYKVIYSTS GNVGYAHKSY INISSTKYGT VVTNGGTLNL
RSSASTSSQI LGNIPSQTVL PIISAEDGWY KVVWGKSVGY VSSTYFKSGT SSENSETSNS
SSTSPTRNEI VEYAKTFLGI YYQWGGNYPQ GSSYGLDCSH YTYQVFKKFG LMNSYMVSAD
QANYVKKITR SELKPGDLVF FKSKSSGNVV HVAIYIGDGQ IIGANGGDSS VNSIETAKKK
NAMVKIQSVD YDSREKIYGR IPGL