Gene Cphy_0410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0410 
Symbolrho 
ID5745170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp520544 
End bp522496 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content36% 
IMG OID641291522 
Producttranscription termination factor Rho 
Protein accessionYP_001557536 
Protein GI160878568 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000426411 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATT TGTATGAAAC AAAGTCACTT TCTGAATTGC GTGATATTGC GAAAGAAAAG 
GGTTTGAAAA GTATTAGCGC ATTACGTAAG CAGGAACTGA TAGATGCTTT GAATGCATTG
GAGAAAGGAC AAGGATCAGT TGCCTCCATA AATAAAAGTA CGGATGATAA ACCTATAAAG
CTTGGAACAG AAGAAGTTAA ATTGGCAACA GAGGATACAA AGCAGAGTGT AGAGAATCGT
GGTATGGAAT CTTCTCATAT GCAAGGAAAT CGAAATAACG TGATACGCAG GCTACCAGAG
CAGAATCGTC AACATGAATT GGTAAAAGCA CCTGATCAAT CTAAGGGAAA TGATAATAGC
CGAATGAGTG AGAGCTATCG CAGTAATGAA GGTCGGAACA TGGATAATCG TTTAGAGAAC
AGAAACAACG ATAATAAAAA TAACGATAAT AGAAATAATG AAAATCGTAA TAATGAAAAT
AGAAACTATG AAAATAGTAG AAATTATGAA AATAGAAGTA ATGAAAATAG AAACAATGAA
AATAGAAACA ATGAAAATAG AAACAATGAA AATAGAAGCA ATGAGATGCT AAGAAATTAT
GATAATAACC GCCGTACGAT GGATAATCGC AGACCAATGG ATAACCGCAG ATTAAATGAA
AATCGCGGAT ATAATGAATC GGATCGTGTA GTCAGAATAA ATCCAACAGG GGATATTTCA
GGTAATCAAG CTACTGAGTT AGGCTATAAT ACAACTTCCT CATCGGATCG TTTTCTTTCA
CAAGCAAATC AACAGGAAGA AAAATTGCCA ATTGATATGG AACAATTGGA TTCTGGTGAA
ACAAAAGAAG GAATTTTAGA GGTATTATCC GATGGATACG GATTTATTCG TTGTGATAAT
TTTTTACCTG GGGAAAATGA TGTTTATGTT TCTCCGGCAC AGATTAGAAG ATTTAATTTA
AAAACTGGTG ATATCATTGT AGGAAATACT AGAATACGTA ATCAAAACGA AAAGTTCAGC
GCTCTTCTTT ATATCAAGCT TATTAATGGA TTACATCCTT CTGAAGCAGT AAAACGGAAG
AAATTCGAAG ATTTAACACC GATCTTCCCA AATGAAAGAA TTCATCTTGA AACGCCTGGC
TGTCAGGTAG CTATGCGAAT GGTAGACTTA ATTTCACCAA TTGGTAAGGG ACAAAGAGGT
ATGATTGTAT CCCAGCCAAA AACAGGTAAG ACTACCTTGT TAAAACAAGT TGCTAAGGCT
ATTACAAGAA ACCATCCTGA GATGCATTTA ATCATTTTAT TAATTGATGA GCGACCAGAA
GAAGTTACCG ATATTAAAGA GTCTATAGAA GGTGGTAATG TAGAAGTAAT TTACTCCACT
TTTGATGAAT TACCAGAGAA TCATAAGCGA GTTTCAGAGA TGGTTATAGA ACGTGCTAAG
CGTCTGGTAG AGCATAAAAA AGATGTTGTT ATCTTATTAG ATAGTATCAC AAGACTTGCA
AGAGCCTATA ACTTAACGGT ACAAGCTAGT GGACGTACCT TATCCGGTGG TCTTGACCCT
GCAGCACTTC ACATGCCGAA AAAGTTTTTC GGAGCAGCAA GAAATATGAG AGAGGGCGGA
AGCCTCACTA TTTTGGCAAC AGCGTTAGTT GAAACCGGTA GCCGTATGGA TGATGTTGTA
TTTGAGGAAT TTAAGGGAAC CGGTAATATG GAACTTGTAT TGGATCGTAA TTTATCAGAG
AAACGAATCT TCCCAGCAAT TGACCTTCCA AAGTCTAGTA CACGTCGTGA TGATTTATTA
CTAAATTCCG CTGAGGTGGA AGCAAATTAT CTTATGAGGA AGGCGTTAAA CGGTCTTAAA
TCAGAAGATG CAGTAGAACG AATTATTCAA ATGTTTGTTA ATACAAAAAA CAATGCAGAA
TTTGTCGAAA TGATTAAGAA AACAAAGATA TAA
 
Protein sequence
MSNLYETKSL SELRDIAKEK GLKSISALRK QELIDALNAL EKGQGSVASI NKSTDDKPIK 
LGTEEVKLAT EDTKQSVENR GMESSHMQGN RNNVIRRLPE QNRQHELVKA PDQSKGNDNS
RMSESYRSNE GRNMDNRLEN RNNDNKNNDN RNNENRNNEN RNYENSRNYE NRSNENRNNE
NRNNENRNNE NRSNEMLRNY DNNRRTMDNR RPMDNRRLNE NRGYNESDRV VRINPTGDIS
GNQATELGYN TTSSSDRFLS QANQQEEKLP IDMEQLDSGE TKEGILEVLS DGYGFIRCDN
FLPGENDVYV SPAQIRRFNL KTGDIIVGNT RIRNQNEKFS ALLYIKLING LHPSEAVKRK
KFEDLTPIFP NERIHLETPG CQVAMRMVDL ISPIGKGQRG MIVSQPKTGK TTLLKQVAKA
ITRNHPEMHL IILLIDERPE EVTDIKESIE GGNVEVIYST FDELPENHKR VSEMVIERAK
RLVEHKKDVV ILLDSITRLA RAYNLTVQAS GRTLSGGLDP AALHMPKKFF GAARNMREGG
SLTILATALV ETGSRMDDVV FEEFKGTGNM ELVLDRNLSE KRIFPAIDLP KSSTRRDDLL
LNSAEVEANY LMRKALNGLK SEDAVERIIQ MFVNTKNNAE FVEMIKKTKI