Gene Nwi_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0100 
Symbolrho 
ID3674300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp120114 
End bp121379 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content59% 
IMG OID637711636 
Producttranscription termination factor Rho 
Protein accessionYP_316720 
Protein GI75674299 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.581004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.035165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAA TGAAACTTCA AGACCTCAAG GCCAAGACTC CAGCCGAACT CGTCTCGTTT 
GCGGAGGAGC TTGGGGTCGA GAACGCCAGC ACCATGCGCA AGCAGGAGCT GATGTTCGCC
ATTCTGAAGC AGCTTGCCAT TCAGGAAACC GACATCATCG GCGAGGGTGT CGTCGAGGTC
CTCTCGGATG GCTTCGGCTT TCTGCGCTCG CCCGATGCCA ACTATCTGCC GGGTCCGGAC
GATATTTACG TTTCACCTTC GCAGATTCGC CGCTTCGGCC TGAGAACCGG CGACACCATT
GAGGGGCACA TCCGCAGCCC CAAGGAAGGC GAGCGCTATT TTGCACTGCT GAAAGTCAAT
ACGCTCAATT TCGAAGACCC CGAGAAGTCG AAGCACAAGG TCAACTTCGA CAACCTGACG
CCCCTGTTTC CCGATCAGCG GTTTCGCCTC GAACTTGAAG ATCCTACCCG CAAGGATTTG
TCGGCGCGGG TCATCGATAT CGTCGCTCCG ATCGGCAAGG GCCAGCGCGC CTTGATCGTG
GCGCCGCCGC GCACCGGCAA GACCGTGCTG ATGCAGAACA TCGCGCACTC CATCACCGCT
AATCATCCGG AGTGCTATCT GATCGTTCTT CTGATCGACG AGCGCCCGGA GGAAGTCACC
GACATGCAGC GCTCGGTGAA GGGTGAAGTC GTATCCTCGA CTTTCGACGA GCCCGCGGTG
CGTCACGTTC AGGTCGCCGA GATGGTGATC GAGAAAGCCA AACGCCTGGT CGAACACGGA
CGCGATGTTG TCATCCTGCT TGATTCGATC ACGCGCCTTG GCCGCGCCTA CAACACCGTG
GTGCCCTCCT CCGGCAAGGT GCTGACCGGC GGCGTCGACG CCAACGCCCT GCAGCGGCCG
AAGCGATTCT TCGGCGCCGC CCGTAATATC GAGGAAGGCG GGTCACTGAC CATCATCGCC
ACCGCGCTGG TCGATACCGG AAGCCGGATG GACGAGGTGA TCTTCGAAGA GTTCAAGGGC
ACCGGCAATT CTGAGTTGAT CCTCGACCGC AAGGTCGCCG ACAAGCGCAC CTTCCCGGCG
ATCGACATCG CCCGCTCCGG AACGCGCAAG GAAGAACTCA TCACCGATCC GCAACTGCTC
AAGAAGATGT ATGTGCTGCG GCGCATCCTC AATCCCATGG GCACCATGGA TGCGATCGAG
TTCCTGCTGG ACAAGCTGCG CAACACCAAG AACAATTCCG AGTTCTTCGA GTCGATGAAT
ACCTGA
 
Protein sequence
MREMKLQDLK AKTPAELVSF AEELGVENAS TMRKQELMFA ILKQLAIQET DIIGEGVVEV 
LSDGFGFLRS PDANYLPGPD DIYVSPSQIR RFGLRTGDTI EGHIRSPKEG ERYFALLKVN
TLNFEDPEKS KHKVNFDNLT PLFPDQRFRL ELEDPTRKDL SARVIDIVAP IGKGQRALIV
APPRTGKTVL MQNIAHSITA NHPECYLIVL LIDERPEEVT DMQRSVKGEV VSSTFDEPAV
RHVQVAEMVI EKAKRLVEHG RDVVILLDSI TRLGRAYNTV VPSSGKVLTG GVDANALQRP
KRFFGAARNI EEGGSLTIIA TALVDTGSRM DEVIFEEFKG TGNSELILDR KVADKRTFPA
IDIARSGTRK EELITDPQLL KKMYVLRRIL NPMGTMDAIE FLLDKLRNTK NNSEFFESMN
T