Gene Dgeo_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1103 
Symbolrho 
ID4058973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1173619 
End bp1174896 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID641230119 
Producttranscription termination factor Rho 
Protein accessionYP_604570 
Protein GI94985206 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0575951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00481193 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGAAC CCCAGACCGT TCCCCTTCCC TTTCACGAAC TCCAGCAGAA GATCCTGCCG 
GAGCTGCATC TGATCGCCGC TCAGAACGGC ATCGAGAACT ACCGCAAGCT GAAAAAGGAG
GCTTTGGCCC TCGCCATTCT GGAGCGGCAG GCGGAAGCCG AGGGGCAGGT GCTGGCGCGC
GGCTTCCTCG ACATCAGCCC CGACGGCTAC GGATTTCTCC AGTCAGACCT CCTGGACCCC
GCCTCGAGAA GCGTGCTGGT CACGGCTGGG GTGATCAAGC AGTTTCACCT GCGCACCGGC
GACGAGGTGA TTGGCCGCGC GCGTCGCCCA CGCGAGAACG AGCGCTACGG CACGCTGGTG
CAGGTGGAGG CAGTCAATGG CCTGGATCCG GAGACAGCGC GCAAGCGTCC GCGGTTTGAC
GACCTGACGC CCACCTTTCC AGACCGGCAA TTGGTGCTGG AAGACCCGCA GATGGATGAC
GGCCTCTCGC TGCGGGTGGT GGACCTGCTC GTGCCTATCG GGCGCGGGCA GCGGGCGCTG
ATTGTCGCGC CGCCCAAAGC CGGCAAGACG ACCCTGCTCA AAAAGATTGC CAACTCGATC
GTCAAGAACT ATCCCGACGT GACGGTGATG GTGCTGCTGG TCGACGAACG CCCCGAGGAG
GTCACCGACT TTCGCGAGAG CGTGCAGGGC GCGCAGGTGA TCGCCTCCAC CTTCGACGAG
CCGCCGCAGC ACCACGTGCG CGTGGCAGAG TTTGTGCATG AACGTGCCCG GCGCATCGTG
GAGGAAGGCG GCCACGTGGT GATTCTGCTC GACTCGATCA CCCGCTTGGC GCGGGCGAAC
AACCTGGTGA CGCCGCCCAC GGGCCGCACC CTCTCGGGGG GGCTGGACTC CAATGCGCTG
CACTGGCCCA AGCGCTTCCT GGGTGCGGCC CGCAACATCC GCGAGGGGGG TTCGCTCACC
ATCTTGGCGA CCGCGCTGGT CGAGACCGGC TCGCGCATGG ACGACGTGAT CTTTGAGGAA
TTCAAGGGCA CCGGCAATGC CGAACTGGTG CTTTCGCGCC GCCTGGAGGA GCGCCGCATC
TTCCCTGCGC TCGACATCCT GAAGTCCGGC ACCCGCCGCG AGGAACTGCT GCTGCAGCCG
GAAGTCCTGA AGAAGATGTG GCTCCTGCGC AAGGTGATCA GCGATATGGA TCCCGCCGAC
GCGATGGAGA TGCTGCTCTC GCGCATGGGC AAGACGCGCA ACAACGTCGA ATTCCTGCAG
TCGTTGGCGG GCGGCTGA
 
Protein sequence
MTEPQTVPLP FHELQQKILP ELHLIAAQNG IENYRKLKKE ALALAILERQ AEAEGQVLAR 
GFLDISPDGY GFLQSDLLDP ASRSVLVTAG VIKQFHLRTG DEVIGRARRP RENERYGTLV
QVEAVNGLDP ETARKRPRFD DLTPTFPDRQ LVLEDPQMDD GLSLRVVDLL VPIGRGQRAL
IVAPPKAGKT TLLKKIANSI VKNYPDVTVM VLLVDERPEE VTDFRESVQG AQVIASTFDE
PPQHHVRVAE FVHERARRIV EEGGHVVILL DSITRLARAN NLVTPPTGRT LSGGLDSNAL
HWPKRFLGAA RNIREGGSLT ILATALVETG SRMDDVIFEE FKGTGNAELV LSRRLEERRI
FPALDILKSG TRREELLLQP EVLKKMWLLR KVISDMDPAD AMEMLLSRMG KTRNNVEFLQ
SLAGG