Gene Paes_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0391 
Symbolrho 
ID6459236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp424595 
End bp425884 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content48% 
IMG OID642724389 
Producttranscription termination factor Rho 
Protein accessionYP_002015095 
Protein GI194333235 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000575661 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0124015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACA ATTCAGTTAC CAAAGGTCTG GACATCAATA TGCTCCAGAA AAAGAAAGTT 
CATGAGTTGA ATGCTTTAGC CAAGGAGATC GGGGTTTCAA GTGCCGGACT GCGCAAAGAG
GAGCTGATTT ACAAAATCAT TGAAGCCCAG TCACAGAAAG GGGGGAGTGC TGAAAATGGA
CAGGTGATGG TCAATACCGG GGTGCTTCAG GTTATTCCTG AAGGCTACGG ATTTCTCCGG
TCGGCAAACT ACAATTACCT CTCATCGCCA GATGATATCT ATGTTTCTCC TTCACAGATC
AAACGCTTTA ACATGCGTAC CGGCGATACT GTATCGGGTC AGGTGCGAGC TCCGAAAGAA
GGGGAGCGGT TTTTTGCGCT GCTTAAAATC GATACTATTG ACGGAAAAGA TCCTGAAATA
ACCAGAATCA GGCCCTTTTT CGATAATCTT ACCCCCCTTT TTCCCACCGA ACGCCTTATG
CTCGAAACCA AGCAGAACGA GCATTGCGGA AGGATTATGG ATATTTACAC TCCGATCGGT
AAAGGCCAGA GGGGCTTGAT TGTCGCTCAG CCGAAGACGG GTAAAACGAT GCTTCTTCAA
ACTGTGGCCA ATGCAATTAT CAAGAATCAT CCGGAGGTGT ATCTGATCGT ACTTCTGATC
GATGAGCGAC CTGAGGAGGT GACCGATATG CAGCGGAGCG TTCCTGCTGA AGTCGTCAGT
TCGACATTTG ATGAAGATCC CGAGCGTCAC GTCCAGGTTG CCGATATGGT TCTGGAGAAA
GCAAAGCGTC TTGTTGAGGT CGGTCACGAT GTGGTGATTC TTCTGGATTC CATTACCCGC
CTTGCTCGTG CCCACAACAC CATTATACCC CATTCAGGGA AGATTCTTTC CGGTGGTATC
GATGCTAACG CACTGACCAA ACCTAAGCGC TTTTTCGGAG CTGCCCGCAA TATCGAGGAG
GGGGGCAGTC TGACTATTAT TGCAACTGCG CTCGTTGATA CCGGTTCGAG AATGGATGAC
GTCATTTTTG AAGAGTTCAA GGGAACAGGA AACATGGAGC TTGTTCTCGA CCGGCGTCTG
TCGGAGCGGC GAATTTTTCC CGCTATCGAT ATTCTCCGTT CGGGTACGCG GAAAGAGGAG
CTTCTGTTCA CCCAGCAGGA ACTCTCCCGA ACATGGCTGC TGCGAAAGTA TCTCGCTGAC
AAAAACCCTA TTGAATGTAT GGAGTTCATG CGTGAAAAGA TGTCTGATAC CAAGGACAAT
AAAGAGTTCT TCAAGTACAT GAACGGTTGA
 
Protein sequence
MSNNSVTKGL DINMLQKKKV HELNALAKEI GVSSAGLRKE ELIYKIIEAQ SQKGGSAENG 
QVMVNTGVLQ VIPEGYGFLR SANYNYLSSP DDIYVSPSQI KRFNMRTGDT VSGQVRAPKE
GERFFALLKI DTIDGKDPEI TRIRPFFDNL TPLFPTERLM LETKQNEHCG RIMDIYTPIG
KGQRGLIVAQ PKTGKTMLLQ TVANAIIKNH PEVYLIVLLI DERPEEVTDM QRSVPAEVVS
STFDEDPERH VQVADMVLEK AKRLVEVGHD VVILLDSITR LARAHNTIIP HSGKILSGGI
DANALTKPKR FFGAARNIEE GGSLTIIATA LVDTGSRMDD VIFEEFKGTG NMELVLDRRL
SERRIFPAID ILRSGTRKEE LLFTQQELSR TWLLRKYLAD KNPIECMEFM REKMSDTKDN
KEFFKYMNG