Gene B21_03610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03610 
Symbolrho 
ID8113853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3854734 
End bp3855993 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID644849774 
Producthypothetical protein 
Protein accessionYP_003001347 
Protein GI251787043 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000108874 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAAT 
ATGGGGCTGG AAAACCTGGC TCGTATGCGT AAGCAGGACA TTATTTTTGC CATCCTGAAG
CAGCACGCAA AGAGTGGCGA AGATATCTTT GGTGATGGCG TACTGGAGAT ATTGCAGGAT
GGATTTGGTT TCCTCCGTTC CGCAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC
GTTTCCCTTA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT CTCTGGTAAG
ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGC TGAAAGTTAA CGAAGTTAAC
TTCGACAAAC CTGAAAACGC CCGCAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC
GCAAACTCTC GTCTGCGTAT GGAACGTGGT AACGGTTCTA CTGAAGATTT AACTGCTCGC
GTACTGGATC TGGCATCACC TATCGGTCGT GGTCAGCGTG GTCTGATTGT GGCACCGCCG
AAAGCCGGTA AAACCATGCT GCTGCAGAAC ATTGCTCAGA GCATTGCTTA CAACCACCCG
GATTGTGTGC TGATGGTTCT GCTGATCGAC GAACGTCCGG AAGAAGTAAC CGAGATGCAG
CGTCTGGTAA AAGGTGAAGT TGTTGCTTCT ACCTTTGACG AACCCGCATC TCGCCACGTT
CAGGTTGCGG AAATGGTGAT CGAGAAGGCC AAACGCCTGG TTGAGCACAA GAAAGACGTT
ATCATTCTGC TCGACTCCAT CACTCGTCTG GCGCGCGCTT ACAACACCGT TGTTCCGGCG
TCAGGTAAAG TGTTGACCGG TGGTGTGGAT GCCAACGCCC TGCATCGTCC GAAACGCTTC
TTTGGTGCGG CGCGTAACGT GGAAGAGGGC GGCAGCCTGA CCATTATCGC GACGGCGCTT
ATCGATACCG GTTCTAAAAT GGACGAAGTT ATCTACGAAG AGTTTAAAGG TACAGGCAAC
ATGGAACTGC ACCTCTCTCG TAAGATCGCT GAAAAACGCG TCTTCCCGGC TATCGACTAC
AACCGTTCTG GTACCCGTAA AGAAGAGCTG CTCACGACTC AGGAAGAACT GCAGAAAATG
TGGATCCTGC GCAAAATCAT TCACCCGATG GGCGAAATCG ATGCAATGGA ATTCCTCATT
AATAAACTGG CAATGACCAA GACCAATGAC GATTTCTTCG AAATGATGAA ACGCTCATAA
 
Protein sequence
MNLTELKNTP VSELITLGEN MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSLSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
FDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS