Gene Cagg_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1356 
Symbolrho 
ID7268648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1680458 
End bp1681711 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content56% 
IMG OID643566199 
Producttranscription termination factor Rho 
Protein accessionYP_002462699 
Protein GI219848266 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00013646 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000585861 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTTCAG GAATTGCCGA AACCGCCGAG AAGGTTCGTC GTCGGCGTCG CCGGGTGAAT 
GGTGAGGCGA GCGAATCTAC GACCAATGAA ACCGTGGCGG TGACCACTCC ACCCACAACC
ACAACGGTAA TTGAAGAACC GGTGACGATG CAAGGTTCTG GTATCCTCGA AATCGTTCCT
GATGGTCACG GTTTTCTCCG CAATCCTCGC CTGACACCTG GCACCGATGA TGTCTACGTG
GCACAGTCGC AAATTCGTCG TTTTAACCTG CGTACCGGCG ATATGATCGA AGGACGGGTG
CGTCCGCCGA AAGAGGTCGA GCGTTATCCG TCGCTGCTCT ATGTCGAGCG GGTGAATGGC
TTGCCGGCGG AAGCTGCACA AAAACGGCCA CTCTTCGAGC ATTTGACACC GATCCATCCC
AATGTCCAAA TTGTGCTCTC GACCGAGGCG AATATTCTGC CTACCCGGAT TGTTGATGTC
ATTGCGCCGA TTGGGCGTGG GCAGCGCGGG TTGATCGTTG CGCCACCCAA GGCCGGGAAG
ACGATGCTCC TGAAGGCTAT TGCCAACGGT ATTACGACCA ATGCACCCGA TATTCAGTTG
ATCGTGCTGC TGATCGGTGA GCGACCCGAA GAAGTGACCG ATATGCGGCG GTCGGTGCAG
GGCGAAGTGG TGGCCGCTAC CTTTGATGAA CCGGTTGAGC AGCATATTAA GGTTGCTGAA
TTGGTACTGG AGAAGGCAAA ACGACAAGTT GAGCACGGTC GCCACGTGGT GATCTTGATG
GACTCGCTGA CCCGCTTGAC CCGTGCCTAC AATATCGCGA TGCCGCCTAG TGGACGAACA
CTTTCCGGTG GTGTCGATCC CGCTGCCCTC TATCCGCCAA AACGTTTCTT CGGCTCGGCT
CGCAATATTG AAGATGGTGG TTCCCTCACT ATTATTGCGA CCTGTCTGGT CGATACCGGT
TCACGGATGG ATGATGTGAT CTACGAAGAG TTCAAAGGCA CCGGTAACAT GGAACTGCAT
CTCGATCGTA AACTGGCCGA GAAGCGCATC TTCCCCGCCG TTGATATTCA GCGCTCGGGT
ACGCGCCGTG AGGATCTGTT GCTCGATCCG GTAACGCTGC GCCAGAGCTG GATGTTGCGA
CGGATGGTCA GCATGGTTGG TGAGAATGAA GGCGCTGAGC TGATGCTGAC CCGGATGGCG
AAGACGAAGA GCAACGCTGA GTTTTTGGCG TCGCTCGGTA AGGTGGGTTC GTGA
 
Protein sequence
MTSGIAETAE KVRRRRRRVN GEASESTTNE TVAVTTPPTT TTVIEEPVTM QGSGILEIVP 
DGHGFLRNPR LTPGTDDVYV AQSQIRRFNL RTGDMIEGRV RPPKEVERYP SLLYVERVNG
LPAEAAQKRP LFEHLTPIHP NVQIVLSTEA NILPTRIVDV IAPIGRGQRG LIVAPPKAGK
TMLLKAIANG ITTNAPDIQL IVLLIGERPE EVTDMRRSVQ GEVVAATFDE PVEQHIKVAE
LVLEKAKRQV EHGRHVVILM DSLTRLTRAY NIAMPPSGRT LSGGVDPAAL YPPKRFFGSA
RNIEDGGSLT IIATCLVDTG SRMDDVIYEE FKGTGNMELH LDRKLAEKRI FPAVDIQRSG
TRREDLLLDP VTLRQSWMLR RMVSMVGENE GAELMLTRMA KTKSNAEFLA SLGKVGS