Gene Clim_0323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0323 
Symbolrho 
ID6353840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp357766 
End bp359055 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content51% 
IMG OID642667952 
Producttranscription termination factor Rho 
Protein accessionYP_001942396 
Protein GI189345867 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0180092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACA ATTCGGTTTC CAAGGGTCTG GACATCAATG TACTCCAGAA AAAGAAAGTG 
TATGAGTTGA ATGCTCTTGC AAAAGAAATA GGGGTATCTG CGGCCGGCTT ACGGAAAGAA
GAGCTGATAT TCAAGATAAT AGAGGCACAG TCACAGAAAA ATACGGATCC TGAAGGCGCC
CAGGTGATGG TCAATACCGG AGTTCTGCAG GTTATTCCTG AAGGATACGG ATTTTTGCGT
TCCGCAAATT ACAACTATCT CTCCTCTCCT GACGATATCT ATGTTTCTCC GTCCCAGATC
AAGCGTTTCA ATATGCGAAC CGGTGATACC GTATCCGGTC AGGTGCGGGC TCCGAAAGAG
GGTGAGCGTT TTTTTGCCCT GCTGAAAATC AATACCATCG ACGGAAACGA TCCTGAAATC
ACCAGGGAAC GGCCTTTTTT TGAAAACCTA ACCCCGCTCT TTCCCAATGA ACGCCTGAAG
CTTGAAACCC GCCAGACGGA GTATTGCGGC AGGATCATGG ATATCTTCAC TCCGATCGGC
AAGGGACAGC GCGGTCTGAT CGTCGCACAG CCGAAAACAG GAAAGACCAT GCTGCTGCAG
ATGATCGCCA ATGCGATCAT TAAAAACCAT CCCGAAGTTT TTCTGATCGT GCTTCTGATC
GATGAACGTC CCGAAGAGGT TACCGACATG GCGCGCAGCG TCGAGGCTGA AGTGGTGAGT
TCCACCTTCG ACGAGGATCC CGAGCGTCAC GTCCAGGTTG CCGATATGGT GCTTGAAAAG
GCCAAGCGGC TTGTCGAAGT AGGAAGGGAT GTGGTGATTC TGCTCGATTC CATCACCAGG
CTCGCTCGTG CGCACAATAC CATCATTCCT CACTCCGGCA AGATTCTTTC CGGCGGTATC
GATGCCAACG CGCTCACCAA ACCGAAACGT TTCTTCGGTG CGGCCCGCAA CATCGAGGAG
GGAGGCAGCC TCACCATCAT CGCTACGGCG CTTGTCGATA CCGGCTCCCG GATGGATGAC
GTTATTTTTG AGGAGTTCAA GGGTACCGGT AACATGGAGC TTGTGCTCGA TCGCAGGCTT
TCCGAACGCA GAATTTTTCC GGCCATCGAT ATTCTCCGTT CCGGAACCCG GAAGGAGGAA
CTGCTCTTCA GTCAGGAAGA GCTGTCGAGA ACCTGGCTGC TGAGAAAATA CCTTGCAGAC
AAGAATCCTG TCGAGTGCAT GGAGTTCATG CGCGAAAAAA TGAGTGACAC AAAGGACAAC
AAGGATTTTT TCAAATACAT GAACGCTTGA
 
Protein sequence
MSNNSVSKGL DINVLQKKKV YELNALAKEI GVSAAGLRKE ELIFKIIEAQ SQKNTDPEGA 
QVMVNTGVLQ VIPEGYGFLR SANYNYLSSP DDIYVSPSQI KRFNMRTGDT VSGQVRAPKE
GERFFALLKI NTIDGNDPEI TRERPFFENL TPLFPNERLK LETRQTEYCG RIMDIFTPIG
KGQRGLIVAQ PKTGKTMLLQ MIANAIIKNH PEVFLIVLLI DERPEEVTDM ARSVEAEVVS
STFDEDPERH VQVADMVLEK AKRLVEVGRD VVILLDSITR LARAHNTIIP HSGKILSGGI
DANALTKPKR FFGAARNIEE GGSLTIIATA LVDTGSRMDD VIFEEFKGTG NMELVLDRRL
SERRIFPAID ILRSGTRKEE LLFSQEELSR TWLLRKYLAD KNPVECMEFM REKMSDTKDN
KDFFKYMNA