Gene Bind_3567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3567 
Symbolrho 
ID6200876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp4046230 
End bp4047498 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content55% 
IMG OID641707523 
Producttranscription termination factor Rho 
Protein accessionYP_001834613 
Protein GI182680467 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.235782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.891828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAA TCAAGCTGCA AGACTTGAAA CTCAAGTCGC CGACCGAATT ATTGGCCTTT 
GCCGAGGAAC ACGAGGTGGA AAACGCCTCG ATCATGCGCA AACAGGAATT GATGTTTGCG
ATCCTCAAAC AATTGGCGAG CCGTGAGGTC GAGATCATCG GCGAGGGCGT CATCGAGGTC
CTGCAGGACG GCTTTGGCTT TCTGCGTTCG CCGGAAGCCA ATTATCTCGC GGGTCCCGAC
GATATTTATG TCTCTCCCTC GCAAATACGG CGCTTTGGCC TGCGCACCGG TGACACCGTC
GAGGGGTTGA TCCGCAGCCC CAAGGAAGGC GAACGTTATT TTGCGCTGCT CAAGGTCAAT
ACGATCAATT TCGAGGACCC GGAGAAAATC CGCCACAAGG TCCATTTCGA CAATCTGACG
CCGCTCTATC CTGATGAAAG GCTGAAGCTC GAAATCGATG ATCCGACCAA GAAGGATCTT
TCGGCGCGCG TCATCGATAT TGTCGCGCCA ATCGGCAAGG GCCAGCGCGC TTTGATCGTT
GCCCCGCCAC GCACCGGTAA AACCGTGCTG TTGCAGAATA TCGCGCAATC CGTGACCGCC
AATCATCCCG AATGCTATCT GATCGTTCTG CTGATCGATG AAAGGCCGGA AGAAGTCACC
GATATGCAGC GTTCGGTGAA GGGCGAGGTC ATTTCCTCCA CCTTTGACGA GCCGGCGGTG
CGCCATGTTC AGGTCGCCGA AATGGTGATC GAAAAAGCCA AAAGATTGGT GGAGCATGGC
CGCGACGTGG TGATCCTTCT GGATTCCATC ACGCGTTTGG GTCGCGCTTA TAATACGGTC
GTGCCGTCAT CGGGCAAGGT TCTGACCGGC GGTGTTGACG CCAATGCCTT GCAACGTCCG
AAACGGTTTT TTGGTGCCGC CCGTAATATC GAGGAAGGCG GTTCGCTGAC CATTATCGCG
ACGGCCCTGA TCGATACGGG TTCGCGCATG GATGAAGTGA TTTTCGAGGA ATTCAAAGGA
ACCGGCAATT CGGAAATTAT CCTCGACCGC AAGGTGGCCG ACAAACGCGT GTTCCCGGCG
ATCGATATCA CGCGCTCCGG CACCCGCAAG GAAGAGCTGC TGGTGCCAAC CGATGTTTTG
AAGAAAATGT ATGTATTGCG GCGTATCCTC AACCCAATGG GTACGGTCGA TGGCATCGAG
TTCCTGCTCG GCAAATTGCG CGAGACCCCA AAGGGCAATG CTACTTTCTT CGAGGCCATG
AATACATAA
 
Protein sequence
MREIKLQDLK LKSPTELLAF AEEHEVENAS IMRKQELMFA ILKQLASREV EIIGEGVIEV 
LQDGFGFLRS PEANYLAGPD DIYVSPSQIR RFGLRTGDTV EGLIRSPKEG ERYFALLKVN
TINFEDPEKI RHKVHFDNLT PLYPDERLKL EIDDPTKKDL SARVIDIVAP IGKGQRALIV
APPRTGKTVL LQNIAQSVTA NHPECYLIVL LIDERPEEVT DMQRSVKGEV ISSTFDEPAV
RHVQVAEMVI EKAKRLVEHG RDVVILLDSI TRLGRAYNTV VPSSGKVLTG GVDANALQRP
KRFFGAARNI EEGGSLTIIA TALIDTGSRM DEVIFEEFKG TGNSEIILDR KVADKRVFPA
IDITRSGTRK EELLVPTDVL KKMYVLRRIL NPMGTVDGIE FLLGKLRETP KGNATFFEAM
NT