Gene Cphy_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1020 
Symbol 
ID5741856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1289839 
End bp1291104 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content33% 
IMG OID641292127 
Producthypothetical protein 
Protein accessionYP_001558139 
Protein GI160879171 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGAA AGGGAACAAT ACTTTTTTTA CCAGTATTAA TCTTATGTAG TTTAGTTAGT 
GGCTGTAATA GGAAAGAAAA CAAGATAAAT GAAAATAATG AAGTGTCTAT GGAAAGTGAC
TCCTTGACAT TAGGAGGTGA ATGGAATGTG GAAGATCGTG TGAAACCACT TTGGAGTACA
AAAAGAGTAG AAGAAGAAAG TGTGATGTTA GTTTCAGTTG ATGGTGCAAT GCCTTGCGGT
AATTTGATTT TTAAGCCTAC TAAAATTATA AGTGTGTATA CTTATGATAT GATAATGGGC
ATTAAGAGAG AATGGAAAGA AGGAGAGGAC TTTGTTATAG ATGATAATCA AATCACAGCA
CTCAGTGAGG ATATACCATC TATGACATCA AAACAGGTTT TAGGGGAAGA GCGAATTCCA
GGTTTTAATT ATTCGGATAT TCCTAGTGTT ACAGCGGGAC TGTATTTACC ATTTACGGAG
GGTACTGAAA TTATATCTAA ACAGATTTAT GTAACTTATG AGCATGAAGA TGAATTTCTT
GGTCATGTAC CAACGTATGC TCTGGAAAAA TTACCACGAA CGAAAGAAAA ACTTAGGAAT
AAAGGAGAAC TTCAATTATT TGTCTATGGA GATTCTATTT CAACAGGTGC AAATAGCACA
GGGTATCTGA ATGTCTATCC TTATAAAGAC AGTTGGCCTG ATATGGTTGC AAAAAATCTA
AGCAGACACT ATGATACGGA AGTTGTTTTA TTGAATAAGG CTGTAGGTGG TTGGACAAGT
GAAAATGCTA TTAAAAGTAC AGAGTCTACA GGTTGGGTTA AGGGGAAAGA AATTAAACAA
TCAGGTATTA AGGGAACACT TGGTGAGATG CCAGATTATT ATCCTGATCT AGTTATCTTA
GGTTTTGGTA TGAACGATGC TACACTTGGA GTTGGAAAAC AAGAGTATAA ATTATATATG
AATAAAATAA TTAAAACAAT ACGAGAGCGT AATAAGGATT GTGAATTTAT TTTGTTAGGA
AGTATGTTAG CAAATCCAAT GGCTATCAAT CAATCAAAAA ATCAAATTTC TTATTTTAAA
ATATTAGAAG AAATTGTGGA AGAAAATGAA GGAATTGTTG CTGTTAATAT TGGCCAGATA
CATCAGGATT TATTAGATGC GGGCAAGGGT TATTTAGATA TGACTAGTAA CAATGTCAAT
CATCCAAATG ATTTTTTTGC AAGTTGCTAT GCTATGAGTA TCTTGTCATT ATTAATTAGC
CATTAA
 
Protein sequence
MVRKGTILFL PVLILCSLVS GCNRKENKIN ENNEVSMESD SLTLGGEWNV EDRVKPLWST 
KRVEEESVML VSVDGAMPCG NLIFKPTKII SVYTYDMIMG IKREWKEGED FVIDDNQITA
LSEDIPSMTS KQVLGEERIP GFNYSDIPSV TAGLYLPFTE GTEIISKQIY VTYEHEDEFL
GHVPTYALEK LPRTKEKLRN KGELQLFVYG DSISTGANST GYLNVYPYKD SWPDMVAKNL
SRHYDTEVVL LNKAVGGWTS ENAIKSTEST GWVKGKEIKQ SGIKGTLGEM PDYYPDLVIL
GFGMNDATLG VGKQEYKLYM NKIIKTIRER NKDCEFILLG SMLANPMAIN QSKNQISYFK
ILEEIVEENE GIVAVNIGQI HQDLLDAGKG YLDMTSNNVN HPNDFFASCY AMSILSLLIS
H