Gene Cphy_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1035 
Symbol 
ID5741871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1307566 
End bp1309035 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content37% 
IMG OID641292142 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001558154 
Protein GI160879186 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTGA GACGAAGAGG TTCTTGCGCT GAGATGGATG GCATCTTACA GTATGTAGAG 
GATGCTATGG CAGGCAAAGA TTGTGGTTGT TGTCCTTCAA GCAATCATGT GATTCATAGT
CGGGTAATTA AAGATTTTAA TACCTTGATT GAGAATGAAA AAAGGATGTC AAAGGCGGCA
AAAGAAGTAT TGGATATTGC AAGTTCCATC AGTAGTTTTG ATGTGGGAAT GTCTTATATT
TCAACGAAAT TAATGGATTT TGCGACTGAA ATGTCTTCTG TCAGTGAATC CAATCTCGCT
ATTGTGGAAG AGACGACTGC AACTATGAAT CAAGTAAATG AAACGATTGA TTATACAGCA
GGTACCTTAG AGAAACTGTC AAATGAATCT GAAATTCTAG CCTCTAAGAA TAATAATAGT
AAAGAATTAT TAGAAGATGT TACTGCACTA AAAGAAAATG TTATTCTAGA TACTAAGATT
ATGAACGACA AAATTGAGCA ACTTGTTGTT TTGGCAACTG AGGTTGGTAA GATTGTTGAA
AGTGTTCAAA CAATTGCAAA TCAAACAAAT TTGTTAGCGT TAAATGCAGC GATAGAAGCA
GCAAGAGCGG GAGAACAGGG AAAAGGTTTT TCTGTTGTAG CAGAAGAAGT TCGTAAGTTA
GCGGATGATA CAAAGCATAA CCTGGAAGGA ATGAGAGCTT TCGTAGATGA TATCCACAAT
GCTTCGAGAG AAGGAAAAGA AAGCATGGAT CGAGCTATGG AATCTACCAG TCAAATGAGT
GATAAGATTG ATATGGTATC CGAGACGATT GGTGAGAATA TCGAAATGCT CCAGGGTGTT
GTGTCTAGTG TTGGGGACAT ACATAATTCA ATGCAAGGAA TTAAACTTGC AGCTAATGAG
ATTAGCAGCG CGATGGAAAC ATCCAGCTCC GATGCGCAAC GTCTTACTGA AATGACACAG
GAAGTTTCTA AGGATGCTCA GGAGAGTGTA AAATATTCAA AGAGTATCTC TGAAATTGAC
GATCGACTAT CACATGAAAT AAGAGAAATG TTTGAAGGAT TGAGTACAGG TAATCAAGCA
GTTACCAATG AAGAGCTTCA ACTTGTGATT GAAAAGGCTG TAAAGGCTCA CTCTGAGTGG
ATGGTTAATT TAAAGAATAT TGTGGATAAC ATGAAGATAG CACCTATACA AACGAATTCA
CATAAGTGTG CATTTGGACA TTTCTATCAT GCACTTGTTA TAGATCATGA AGCAATTGAG
AAGGAATGGA AAGAAATCGA TGGATTCCAT GATCAATTCC ATAGAATGGG AGATAAGGTA
ATAAAAGCTG TAAAAGCACA AGATAGAAAG ATGGCGAATG ATTTATATAA TGAGGCATCT
GTTGTTTCTA CTCAGATACT TGGATTACTG CAGAAAGTTC ACCAAAAGAT AGAACAATTA
AATAAACAAG GAATAAAGAT TTTTGATTAA
 
Protein sequence
MRLRRRGSCA EMDGILQYVE DAMAGKDCGC CPSSNHVIHS RVIKDFNTLI ENEKRMSKAA 
KEVLDIASSI SSFDVGMSYI STKLMDFATE MSSVSESNLA IVEETTATMN QVNETIDYTA
GTLEKLSNES EILASKNNNS KELLEDVTAL KENVILDTKI MNDKIEQLVV LATEVGKIVE
SVQTIANQTN LLALNAAIEA ARAGEQGKGF SVVAEEVRKL ADDTKHNLEG MRAFVDDIHN
ASREGKESMD RAMESTSQMS DKIDMVSETI GENIEMLQGV VSSVGDIHNS MQGIKLAANE
ISSAMETSSS DAQRLTEMTQ EVSKDAQESV KYSKSISEID DRLSHEIREM FEGLSTGNQA
VTNEELQLVI EKAVKAHSEW MVNLKNIVDN MKIAPIQTNS HKCAFGHFYH ALVIDHEAIE
KEWKEIDGFH DQFHRMGDKV IKAVKAQDRK MANDLYNEAS VVSTQILGLL QKVHQKIEQL
NKQGIKIFD