Gene Cphy_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3669 
Symbol 
ID5742693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4507271 
End bp4509559 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content35% 
IMG OID641294779 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001560755 
Protein GI160881787 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000205913 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAA AGAGCAAAAA GTTATTTGGT AATCTCAGAA AAACAAAAAA GGGAAATGTT 
GATTTTGATG AGACTATGAT GGAACAAATG AAACAAGAAA CAAAGAAAAA ACGAAAAGTA
AAATTTAAAC GTAACAAAGA TGAGGAGTTT CAAACAGAGG GAACGAAAAA GAAACAAAAA
TTTAAGAACA AAGAATTAAA GAAGGCGATT AAAGAAGGCA TAGAAAATGA TGTCGATATT
GTTCCTAGAA AAGTAAGGAA TGGTAAGTTT TTTAGAGTTA TTGCTAGTAT TAGAACGCAA
ATGCTTATCG GTTTTACTGT ACCGATTTTA TTTGTCATAA TAGTTGGGGC AATATCATAT
ATAAAAGCTT CGAATGGGTT AGTAAGTGGT TTTGAAGATT CTGCAACAAA AACAATCGGT
ATGGCGGTGA ACTACATTGA TGTTGGAATG AGCACCCTAG AATCTGAGGC CTTTGTGCAG
TCTCAAAATG ATAATATTAC AAGTTATATA TTTAGTAGTA AAAACGAAGA ACAAATAAAA
TTGTATGAGT TGACACAGGC TATTTATGCT CAATTAACAG CTGCACAAGT AGCAAATGAA
TTCATTCAGG ATATACATAT TATACCAAAA GACCATGCAA AAGTGTTATC TACAAAAACC
AGGGGAGTAA TAGGGTTTAG TGATGCCATG AAGGATACCG ATGCCGCATC AATGACAGAA
TCAAAAAATC AAGCTGCTTG GGTTGGTAGT CATGCTATGG TTGATGACGA ATTAAAAACT
TCCTATAATA ATTATGCTTG CACATTTATC CGACAGTTTA AGTCAAAGAA TGGTTATATC
GTAATTGACT TAAGTAAGAA AAAAGTAGTT CAACTCTTAC AGGATATTGA ATTAGCAGAA
GGAAGTTATA TTTCCTTTGT AATGGATGAT GGTAGAGAAA TTTCCGCAAT AAGCGATACT
ATTGCCTTTA GCGATACTAC TTATTATAAA GAAGGTATGG CTAACAACGA AGGTAACTAT
AACACATATG TAAAAGTTGA TAAAAAGGAT TACCTCTTTA TGATGGCAAA GAGTAGTTCA
AAAGGATTCT CTATTTGCGC ATTAGTACCA AAAGCGTCTG TAATGAAAAG TGCTAGTGAC
ATTAAAGGGG TTACGGTTAG TGTCGTTGCA ATAGCTATCC TTGTTGCAGC ATTCGTGGGT
GGATTAATAT CCATTATGAT TGGTAAGAGT ATTCATCGTA TTTCGAAAAA ATTAATCAAA
GTATCTGAGG GTGATTTAAC CATTGATATG GACATCCATA CAAGCAATGA ATTTGGTATG
CTTGCCGGTA ACGTTAAGGA AATGGTAAAT AACACGAGAG ATCTTATTCA CAAAGTAGTT
CAGGTTACGA ATTTAGTTAC AGAGGCTACC AAGAGTCTTT CAGCGACATC AAGGGATATG
ACCGATTCTA GTGAGCACAT TACTACAGCA ATCAATGAGA TAGATATCGG AATTGCACAA
CAGGCTGAAG AAGCTCAGTT ATGTTGTAAC CAGATGGATG AGCTATCGAA TAAGATGGGT
ATCGTAAATA ATAATGTTAA CGAGATTCAA ACTCTTGCTG ATCAGACGCA GGTGATGATT
CAAAATGGTA TCACTACAAT GACCTTGCTA ACAAAACAAT CTCAGACAAC GAATGAAATA
ACACAACAGG TAATGACTGA TATTAAGGCT CTTCAAAAAC AGTCTGCGTC TATTGAACAA
TTCATTGGAA TTATTAATGA TATTGCAAGC CAAACAAATC TCTTATCTTT AAATGCTTCT
ATCGAAGCTG CTAGAGCTGG TGACGCCGGA AGAGGATTTG CAGTAGTTGC TGAAGAAATT
CGTAAATTAG CAGAAGGTTC TGTAAATGCA GCACAAGAGA TTCAGAAAGT TGTTGTTGAT
ATTAAAACAA AAACTGAATC AACAGTACAG ACAGCGCAAA AAGCAGAAAC GGAAGTTACT
TCTCAAGTGA AGTCTGTAGA AACAACTAGG GAAGCATTCC ACAGCATGAG CGAATGTGTA
GATAGTCTAT TAACAAATCT AAAAGAAGTA ATTGAAAATG TTGAAAATAT GAATGAAGAC
AGACAAAAGA CTTTAGATTC AATTGAAAGT ATTTCTGCAG TTTATGAGGA AACCGCAGCT
TCTTCTTCTA TTGTTAATAA CACAGCTCAG ATGCAGTTAG GACTATCCAA AACTCTTGTG
GAAGGTACGC AAGAATTAGA GCTACGTACG GAAGAGTTAA AAGATGCAAT GCGTAAGTTT
ACAGTATAA
 
Protein sequence
MIEKSKKLFG NLRKTKKGNV DFDETMMEQM KQETKKKRKV KFKRNKDEEF QTEGTKKKQK 
FKNKELKKAI KEGIENDVDI VPRKVRNGKF FRVIASIRTQ MLIGFTVPIL FVIIVGAISY
IKASNGLVSG FEDSATKTIG MAVNYIDVGM STLESEAFVQ SQNDNITSYI FSSKNEEQIK
LYELTQAIYA QLTAAQVANE FIQDIHIIPK DHAKVLSTKT RGVIGFSDAM KDTDAASMTE
SKNQAAWVGS HAMVDDELKT SYNNYACTFI RQFKSKNGYI VIDLSKKKVV QLLQDIELAE
GSYISFVMDD GREISAISDT IAFSDTTYYK EGMANNEGNY NTYVKVDKKD YLFMMAKSSS
KGFSICALVP KASVMKSASD IKGVTVSVVA IAILVAAFVG GLISIMIGKS IHRISKKLIK
VSEGDLTIDM DIHTSNEFGM LAGNVKEMVN NTRDLIHKVV QVTNLVTEAT KSLSATSRDM
TDSSEHITTA INEIDIGIAQ QAEEAQLCCN QMDELSNKMG IVNNNVNEIQ TLADQTQVMI
QNGITTMTLL TKQSQTTNEI TQQVMTDIKA LQKQSASIEQ FIGIINDIAS QTNLLSLNAS
IEAARAGDAG RGFAVVAEEI RKLAEGSVNA AQEIQKVVVD IKTKTESTVQ TAQKAETEVT
SQVKSVETTR EAFHSMSECV DSLLTNLKEV IENVENMNED RQKTLDSIES ISAVYEETAA
SSSIVNNTAQ MQLGLSKTLV EGTQELELRT EELKDAMRKF TV