Gene Dshi_3453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3453 
Symbolrho 
ID5712511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3634311 
End bp3635582 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content58% 
IMG OID641269382 
Producttranscription termination factor Rho 
Protein accessionYP_001534787 
Protein GI159045993 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAG AGCGTCTCGC CCTTGCCGAT CTCAAAGCTA TGAGCCCTGC TGATCTTCTG 
CCCCTGGCCG AGGAGCTGGA GGTTGAAAAC GCATCCACCA TGCGGAAAGG CGACATGATG
TTCGCCATCC TCAAGGAGCG TGCAGAGGAT GGATGGGAAA TTTCTGGCGA TGGCGTTCTG
GAAGTCATGC AGGACGGGTT CGGGTTTCTC CGCTCCCCCG AGGCGAACTA TCTGCCGGGC
CCGGACGACA TCTATGTTTC TCCCGAGAAA ATTCGGCAGC ACTCCCTGCG CACTGGCGAT
ACCGTCGAAG GGGTGATCCA GGCCCCGCGG GAGAACGAAC GCTATTTCGC GATCACCTCC
GTCAGCCTGG TAAACTTCGA AGACCCGGAG ATGGCACGTC ACAAGGTGAG CTTTGATAAC
TTGACGCCTC TTTATCCGGA CGAGCGGCTG AAGATGGAGA TCGAGGATCC GACGATCAAG
GATCGGTCTG CCCGGATCAT CGACCTGGTG GCACCGATCG GGAAGGGGCA GCGGTCGCTG
ATCGTGGCTC CGCCGCGGAC GGGCAAGACG GTCCTGTTGC AGAATATCGC CAAGAGCATC
GCAGCCAATC ATCCTGAATG CTATCTGATG GTGCTGCTGA TCGATGAACG TCCAGAAGAA
GTCACGGACA TGCAGCGTTC GGTGAAAGGG GAGGTGATTT CCTCCACTTT CGATGAGCCG
GCCTCGCGCC ACGTGGCAGT GTCCGAGATG GTGATCGAGA AGGCCAAGCG CCTGGTGGAG
CACAAGCGGG ACGTGGTTAT CTTGCTGGAT TCGATCACTC GTCTTGGTCG GGCTTTCAAC
ACGGTCGTTC CGTCTTCGGG CAAGGTGCTG ACCGGGGGTG TGGATGCCAA TGCGTTGCAG
CGGCCGAAGC GGTTCTTCGG TGCGGCCCGA AACATCGAAG AAGGCGGGTC GCTGACAATC
ATCGCAACCG CATTGATCGA TACAGGCTCC CGTATGGACG AAGTGATCTT TGAAGAGTTC
AAGGGTACGG GCAACTCCGA GATCGTGCTT GACCGCAAGG TGGCCGACAA GCGGGTCTTC
CCGGCGATGG ACATCCTCAA ATCCGGCACG CGCAAAGAGG ATCTCCTTGT CGATGCCAAG
GATCTTCAGA AGACCTTTGT GCTGCGTCGC ATTCTGAATC CCATGGGGAC CACGGATGCC
ATCGAGTTCC TGATCTCGAA GCTGAAGCAG ACGAAGACCA ATTCCGAGTT CTTCGATTCG
ATGAACACCT GA
 
Protein sequence
MTQERLALAD LKAMSPADLL PLAEELEVEN ASTMRKGDMM FAILKERAED GWEISGDGVL 
EVMQDGFGFL RSPEANYLPG PDDIYVSPEK IRQHSLRTGD TVEGVIQAPR ENERYFAITS
VSLVNFEDPE MARHKVSFDN LTPLYPDERL KMEIEDPTIK DRSARIIDLV APIGKGQRSL
IVAPPRTGKT VLLQNIAKSI AANHPECYLM VLLIDERPEE VTDMQRSVKG EVISSTFDEP
ASRHVAVSEM VIEKAKRLVE HKRDVVILLD SITRLGRAFN TVVPSSGKVL TGGVDANALQ
RPKRFFGAAR NIEEGGSLTI IATALIDTGS RMDEVIFEEF KGTGNSEIVL DRKVADKRVF
PAMDILKSGT RKEDLLVDAK DLQKTFVLRR ILNPMGTTDA IEFLISKLKQ TKTNSEFFDS
MNT