Gene CPF_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2471 
Symbolrho 
ID4203781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2740270 
End bp2741739 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content29% 
IMG OID638083336 
Producttranscription termination factor Rho 
Protein accessionYP_696885 
Protein GI261876159 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.94389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTAAAT TGAATATAGA ACAATATGAA AATATGACTT TAGTTCAGCT AAAAGAGCAG 
GCTAAAGAAT TAGGAATAAA AAATATAAGT AAAAAGAAAA AAAGCGAATT AATTGAGGAA
TTAAAAGCTG AATTCAATAA ACAAAATAGC AATACTAAAG TAATACATAA AGATGGTGTT
ATTTTAAGAG AAAAAATATC TCAAAAAGAG CAAGGTGGAA ATAAAGAAAG TTTTTCTGGT
GAAAGAGTAG TTAGAAATAG CAATAATTAT AATAACAATA ATAGTAATTA TAATAATTCA
AATGATGAAG GAAAAAAAGA GCAATTAAAG GACATGATAA GTTCATCAGA CTCTGCTAAG
GGAATACTTG AAATACTAGA TAATAATAAT TTTGGTTTTT TAAGATGTAG AAATTATTTA
ACAAGTGAAG ATGATATATA TGTTTCTCCA TCACAAATAA GAAGATTTGG ATTAAGAACT
GGGGACGAAG TCCAAGGTAA AGTTAGAATA CCAAAGGATG GAGAAAAGTT TAAAGCTTTA
CTTTATGTAG AGAGAGTAAA TGGTGAAAGT CCTGAAAAAG CAGTTGGAAG AAAAAAATTT
GAAGAGTTAA CACCAATATA TCCAAAAGAG AGATTAAGAC TTGAGACTGA AAATGGAAGA
GATTTAAGTT CAAGACTTAT GGATATAATT TGTCCAATAG GTAAGGGACA AAGGGGGATG
ATAGTTGCAC CACCTAAGGC TGGAAAAACA ACTCTTTTAA AAAGAATAGC TCAAAACATA
TCAAAAAATA ATCCTGAAGT TAAACTTATA GTACTTTTAA TAGATGAAAG ACCAGAAGAA
GTAACGGATA TGAAGAGAAG TATAGATGGA GAAGTAATCT ATTCAACTTT TGATGAAGAA
CCACAAAACC ATGCTAAAGT TTCAAGTATA GTATTAGAAA GAGCTAAAAG AATGGTTGAA
CAAGGAAGAG ATGTAGTTAT CTTAATGGAT TCTTTAACAA GATTATCAAG AGCCTATAAC
TTAACAGTTA CTCCATCTGG AAGAACTTTA TCAGGGGGAC TAGATCCAGG GGCACTTATA
ATGCCTAAGA AATTCTTTGG AGCAGCTAGA AATTTAGAAG AGGGCGGAAG TTTAACAATT
TTAGCAACTT CTTTAGTTGA TACAGGAAGT AGAATGGATG ATATGATTTT TGAAGAGTTT
AAGGGTACTG GAAATATGGA GGTACACTTA GATAGAAAGC TTCAAGAGAG AAGAATATTC
CCAGCTATAG ATATTTATAA ATCTGGTACT AGAAAAGAAG ATTTATTATT TAATGAAGAA
GAAAGAGAAG CTTCATATAA AATAAGAAGA GTATTACAAA AGGAAAATAA TATAGAGGAT
GTAGCTGAAA AGCTTATAAA TTTATTATCA AAAACAAAAA ATAATAAAGA GTTTTTACAA
GTTGTTTTAA AAAGCAATTT AGAAAATTAA
 
Protein sequence
MLKLNIEQYE NMTLVQLKEQ AKELGIKNIS KKKKSELIEE LKAEFNKQNS NTKVIHKDGV 
ILREKISQKE QGGNKESFSG ERVVRNSNNY NNNNSNYNNS NDEGKKEQLK DMISSSDSAK
GILEILDNNN FGFLRCRNYL TSEDDIYVSP SQIRRFGLRT GDEVQGKVRI PKDGEKFKAL
LYVERVNGES PEKAVGRKKF EELTPIYPKE RLRLETENGR DLSSRLMDII CPIGKGQRGM
IVAPPKAGKT TLLKRIAQNI SKNNPEVKLI VLLIDERPEE VTDMKRSIDG EVIYSTFDEE
PQNHAKVSSI VLERAKRMVE QGRDVVILMD SLTRLSRAYN LTVTPSGRTL SGGLDPGALI
MPKKFFGAAR NLEEGGSLTI LATSLVDTGS RMDDMIFEEF KGTGNMEVHL DRKLQERRIF
PAIDIYKSGT RKEDLLFNEE EREASYKIRR VLQKENNIED VAEKLINLLS KTKNNKEFLQ
VVLKSNLEN