Gene Cagg_3627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3627 
Symbolrho 
ID7269771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4406926 
End bp4408188 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content51% 
IMG OID643568434 
Producttranscription termination factor Rho 
Protein accessionYP_002464900 
Protein GI219850467 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.2483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.795626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTAG CTGAATTAGA AAGTAAAACC CTCGCAGATT TGCGCGAGAT AGCACGAAAA 
TATGACATCT CAGGTGTTAG CTCGCTGAAA AAACGTGAAT TAATCGACAA GTTACTCCAG
GTCCAGATGG CAACCGTCGC ACCAACGACA GATACGGAAA CAATTTACAG CGACGGGATT
TTAGACATTA TGCCGGAAGG ATTCGGTTTT TTGCGCGGCA GTCGGATGCT GCCCAGCCCA
GAGGATGTCT ACGTTTCACA ATCACAGATT CGCCGCTTTG CCTTACGGAG TGGTGATCGG
ATCTGGGGAC AGATTCGCCC ACCCAAGGAG AACGAGCGCT ACTACTCACT GTTACGTGTG
GAAAAGATAA ATGACCAAGA CCCCGAAACG GCACGTAAAC GGCCGCTGTT TGATCAGCTC
ACACCGATTT TTCCCAACGA ACAGATCAAG TTAGAAACCG AACCCAATCT TTTACATACT
CGATTAGTCG ATCTGATTGC TCCCATCGGC CGTGGTCAGC GTGGCCTCAT CGTTTCACCA
CCGAAAGCCG GCAAAACAAT GCTGTTGAAG GCAATTGCCA ACGGCATTAC GACCAACTAT
CCTGACATCC ATTTGATGGT ATTGTTGATC GGTGAACGAC CCGAAGAGGT CACCGATATG
CGGCGTTCGG TACGAGGTGA GGTGATTTCT TCGACCTTTG ATGAGCCGGT AGAAAACCAC
ACAAAAGTCG CCGAAATGAC GCTTGAACGG GCGAAGCGGC TCGTTGAGAT TGGTCATGAT
GTCGTGATTC TTATGGACTC CATCACCCGG TTAGCCCGTG CTTACAATGT CGCAATGCCT
CCGAGTGGGC GCACACTATC CGGTGGTATC GACCCAATTG CACTCTACCC ACCCAAACGC
TTTTTTGGCG CCGCACGCAA CATCGAAAAC GGTGGATCGC TCACGATCAT CGCCACCTGT
CTCATCGATA CCGGTTCACG CATGGATGAC GTCATTTACG AAGAGTTTAA AGGCACCGGT
AATATGGAGC TACACCTCGA CCGGAAGTTG GCCGAAAAAC GGATCTTCCC GGCGATTGAC
ATTCAACGTT CCGGCACGCG CCGTGAGGAT CTCTTACTTA ACCCCGAGAC GCTCCGCCAA
GTGTGGACGT TGCGCCGTAT GGTGAGTATG CTCGGTGACA ATGAAGGCAC TGAGCTGATG
CTGACCCGGA TGGCAAAGAC GAAATCGAAT GCCGAATTCC TGCAAACGTT GAGCAAAAGC
TGA
 
Protein sequence
MTVAELESKT LADLREIARK YDISGVSSLK KRELIDKLLQ VQMATVAPTT DTETIYSDGI 
LDIMPEGFGF LRGSRMLPSP EDVYVSQSQI RRFALRSGDR IWGQIRPPKE NERYYSLLRV
EKINDQDPET ARKRPLFDQL TPIFPNEQIK LETEPNLLHT RLVDLIAPIG RGQRGLIVSP
PKAGKTMLLK AIANGITTNY PDIHLMVLLI GERPEEVTDM RRSVRGEVIS STFDEPVENH
TKVAEMTLER AKRLVEIGHD VVILMDSITR LARAYNVAMP PSGRTLSGGI DPIALYPPKR
FFGAARNIEN GGSLTIIATC LIDTGSRMDD VIYEEFKGTG NMELHLDRKL AEKRIFPAID
IQRSGTRRED LLLNPETLRQ VWTLRRMVSM LGDNEGTELM LTRMAKTKSN AEFLQTLSKS