Gene Cphy_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3850 
Symbol 
ID5744802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4714233 
End bp4715741 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content38% 
IMG OID641294962 
Productflagellin domain-containing protein 
Protein accessionYP_001560936 
Protein GI160881968 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAA ATCATAATAT TTCAGCACTT CACGGTAATA ACCAATTAAA AATTAATAAT 
AATGCATTAG ATAAGAGTTT AGAAAGATTA AGCAGTGGTT ACCGAATCAA TCGTGCAGCT
GATGATGCGG CAGGACTTGC TATCTCAAGA AAAATGAAAA CACAGATTGA AGGTTTAGAG
CAATCTTCAA GAAATGCTTC CGATGGTGTA TCAGTTATAC AGACAGCAGA GGGTGCATTA
AATGAGGTTA ATGCAATGTT ACAGCGTATG AGAGAACTCT CAGTGCAAGC TGCAAACGGT
ACCAACACTG CAGAAGATCG CTTGGCAATT CAAAAAGAGA TTAACGCGTT AAACAACGAA
ATCACCCGTA TTTCAACAGA CACTGAGTTC AATACAAAAC CTTTATTAAA TGGAAATTTA
GATTGCCAGA GTTATTCCAA TACCTCTGAT GTGGAAATGA TTTCTCTATC CGATAATGTA
GATGCAAAAG ATTATAACTT TATTATAACT GGGGATGCAA GACAGGCAGT TATGACTGGA
ATGCAATTAG GTGGACTTTC TGATCAAATT GCTGATGATC AGGCAGGCGT TATTAATATT
AATGGTATAG AGATAAAAAT TAACGCTGGT GATACCATGG AGCAGGTATT TGAAAAGCTT
CGTGGAGCCT GTGACACAAT GAATATTAAA GTGTTTGCTC AGGTTGGTAC ATCCGTAAAT
CCAGACTATG ATGGATTTGC TGGCTATGAG AGTGGACCGA TTGATAATGG TTCCCTTGTA
TTTATGACAA AGGAATATGG TTCCAATCAG ACAATTGAGA TGCATTGTGA TAACGATAAA
CTAAGCGGCT TATTAGGTAT TAGCAGTGGT GGTGCGAAAG CTATTGGTGT AGATGCAAAA
GCGACGTTAG GAAATGGTTT TTCCTCTACT GCTACGGCTT CTTGCAGTGG CAACATTATC
ACAGTAACAG ACGGTGACGG CTTTGAAATT AAATTTAAGG CTACTCCTGG AGCAGCTAAA
ACTGCATTTA CTGATCAAAC AGTAAATAAT GATGGAGCAA GCATAACAGA TGGTGCTGGT
TCTGATAATG TTTCTATTAC AGTTTTACAA GCTGGACCTA TGGATCTTCA GATTGGTGCC
AATGAAGGAC AAACGATGGA AGTACGAATT CCTCGTGTAG ATACTTATAC TCTTGGAACA
AATATTGTAA ATGTTTGTAC TCAGGAGGGA GCTTCTAGTG CAATTTCCAT TCTAAGTAAA
GCGATTACTA TGGTAACTGA TATTCGTGCA AAGCTTGGTG CATATCAAAA TCGTTTGGAG
CATGCGATTG CAAACTTGGA TGTTGGAGCT GAAAATATTA CGGAAGCTTT ATCTCGTATC
GAAGATACCG ATATGGCAAA AGAAATGTCC TTATTTACTC AGAAAAACGT GTTAGTACAA
GCAGGCACTG CTATGTTAGC GCAAGCGAAT CAGAGACCAC AGAATATTCT ATCCTTATTA
CAAAGTTAA
 
Protein sequence
MRINHNISAL HGNNQLKINN NALDKSLERL SSGYRINRAA DDAAGLAISR KMKTQIEGLE 
QSSRNASDGV SVIQTAEGAL NEVNAMLQRM RELSVQAANG TNTAEDRLAI QKEINALNNE
ITRISTDTEF NTKPLLNGNL DCQSYSNTSD VEMISLSDNV DAKDYNFIIT GDARQAVMTG
MQLGGLSDQI ADDQAGVINI NGIEIKINAG DTMEQVFEKL RGACDTMNIK VFAQVGTSVN
PDYDGFAGYE SGPIDNGSLV FMTKEYGSNQ TIEMHCDNDK LSGLLGISSG GAKAIGVDAK
ATLGNGFSST ATASCSGNII TVTDGDGFEI KFKATPGAAK TAFTDQTVNN DGASITDGAG
SDNVSITVLQ AGPMDLQIGA NEGQTMEVRI PRVDTYTLGT NIVNVCTQEG ASSAISILSK
AITMVTDIRA KLGAYQNRLE HAIANLDVGA ENITEALSRI EDTDMAKEMS LFTQKNVLVQ
AGTAMLAQAN QRPQNILSLL QS