Gene Nmul_A2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2085 
Symbolrho 
ID3786089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2377975 
End bp2379234 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID637812174 
Producttranscription termination factor Rho 
Protein accessionYP_412771 
Protein GI82703205 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTAT CCGACCTCAA AAACTTCCAC GTCACCGAAC TGGTGGAAAT GGCAATTGCC 
AATGAAATTG ACGGTGCTAA CCGGTTACGG AAACAGGATC TGATTTTCGC ATTATTGAAG
AATCAGGCAC GGAAAGGCGA AAGCATTTTC GGCGAAGGCA CCTTGGAAGT GCTGCAGGAT
GGTTTTGGCT TCTTGCGTTC TCCCGACACC TCATATCTCG CCGGTCCGGA TGACATTTAT
GTCTCACCCA GCCAGATACG GCGCTTCAAC CTTCATACGG GGGATTCGAT AGAGGGCGAA
ATAAGGACGC CCAAGGACGG GGAGCGTTAT TTCGCGCTGG TCAAGGTCGA TAAGGTCAAC
GGAGAACCTC CTGAAAACTC CAAACACAAG ATACTGTTTG AAAATCTCAC CCCCCTCTTT
CCCACCGAGC GGCTGCAGCT CGAACGCGAG ATCAAGGCCG AGGAAAACGT TACCAGTCGC
ATCATCGATC TGATCGCCCC CATCGGCAAA GGACAGCGGG GCCTCCTGGT AGCCAGTCCC
AAATCAGGCA AGACGGTCAT GCTTCAGCAC ATCGCCCACG CCATTGCCGC CAATTACCCG
GATGTGATGC TGATGGTGCT GCTGATAGAC GAGCGTCCCG AAGAAGTGAC CGAGATGATC
CGGTCGGTAA GAGGTGAAGT GATTTCTTCC ACTTTCGATG AGCCGGCTGT ACGTCACGTG
CAAGTGGCGG ATATGGTCAT CGAAAAAGCC AAACGGTTGG TGGAACACAA GAAGGACGTG
GTAATCCTGC TGGATTCGAT CACCCGGCTC GCGCGCGCCT ATAACACAGT GGTACCTGCT
TCCGGCAAGG TGCTGACGGG GGGTGTGGAT GCCAACGCGC TGCAACGGCC GAAGCGGTTT
TTTGGGGCTG CTCGCAATAT CGAGGAGGGT GGCTCTCTCA CCATTATTGC CACCGCGCTG
GTCGACACCG GCTCGCGCAT GGACGATGTG ATTTATGAGG AATTCAAGGG TACCGGCAAC
ATGGAAATCC ATCTTGACCG GCGCATGGCA GAAAAACGCA TTTATCCCGC CATCAACGTC
AACCGTTCCG GCACCCGCCG GGAAGAACTC CTGATCAAGC CCGATGTATT GCAGAAAATC
TGGGTACTGC GCAAGCTGCT CTACCCGATG GATGACATGG AGGCAATGGA ATTTCTGCTC
GACAAGATCA AGGCAACGAA GAACAACGCG GATTTCTTCG ATTCAATGCG GCGGGTCTAA
 
Protein sequence
MHLSDLKNFH VTELVEMAIA NEIDGANRLR KQDLIFALLK NQARKGESIF GEGTLEVLQD 
GFGFLRSPDT SYLAGPDDIY VSPSQIRRFN LHTGDSIEGE IRTPKDGERY FALVKVDKVN
GEPPENSKHK ILFENLTPLF PTERLQLERE IKAEENVTSR IIDLIAPIGK GQRGLLVASP
KSGKTVMLQH IAHAIAANYP DVMLMVLLID ERPEEVTEMI RSVRGEVISS TFDEPAVRHV
QVADMVIEKA KRLVEHKKDV VILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALQRPKRF
FGAARNIEEG GSLTIIATAL VDTGSRMDDV IYEEFKGTGN MEIHLDRRMA EKRIYPAINV
NRSGTRREEL LIKPDVLQKI WVLRKLLYPM DDMEAMEFLL DKIKATKNNA DFFDSMRRV