Gene Ent638_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4003 
Symbolrho 
ID5110468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4340554 
End bp4341813 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID640494221 
Producttranscription termination factor Rho 
Protein accessionYP_001178709 
Protein GI146313635 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.499603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.016258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATTACTCT CGGCGAAAAC 
ATGGGCCTTG AAAACCAGGC TCGTATGCGC AAGCAGGACA TCATTTTTGC CATCCTGAAG
CAGCATGCTA AGAGTGGCGA AGATATCTTT GGCGACGGTG TACTGGAGAT ATTGCAAGAC
GGATTTGGTT TCCTCCGTTC TGGAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC
GTATCCCCTA GCCAAATCCG CCGTTTCAAC CTCCGTACTG GTGACACCAT TTCAGGTAAG
ATTCGTCCTC CTAAAGAGGG TGAACGCTAC TTTGCGCTGT TGAAAGTTAA CGAAGTTAAC
TACGATAAAC CTGAAAACTC GCGCAATAAG ATCCTGTTTG AAAACTTAAC GCCGCTGCAC
GCGAACTCTC GCCTGCGCAT GGAGCGTGGT AACGGTTCTA CCGAAGACCT GACGGCTCGC
GTTCTGGATC TGGCGTCTCC AATTGGTCGT GGTCAGCGTG GCCTGATCGT GGCACCACCG
AAAGCAGGTA AGACCATGCT GCTGCAGAAC ATCGCGCAGA GCATTGCTTA CAACCATCCT
GATTGCGTAC TGATGGTTCT GCTGATCGAT GAGCGTCCAG AAGAAGTAAC AGAGATGCAG
CGTCTGGTGA AAGGTGAAGT GATTGCATCT ACCTTTGATG AGCCAGCCTC TCGCCACGTT
CAGGTTGCTG AAATGGTTAT CGAGAAAGCT AAGCGTCTGG TCGAGCACAA GAAAGACGTT
ATTATTCTGC TCGACTCCAT CACTCGTCTG GCGCGTGCTT ACAACACCGT AGTTCCTGCT
TCCGGTAAAG TACTGACCGG TGGTGTGGAT GCGAACGCAT TACACCGTCC GAAGCGTTTC
TTTGGTGCCG CGCGTAACGT TGAAGAGGGA GGAAGCCTGA CGATTATCGC AACCGCTCTG
GTTGATACCG GCTCTAAAAT GGATGAAGTT ATCTACGAAG AGTTTAAAGG CACCGGTAAC
ATGGAGCTGC ACCTGGCACG TAAAATCGCC GAGAAGCGCG TCTTCCCAGC GATTGATTAC
AACCGTTCAG GGACGCGTAA AGAAGAGCTG CTCACCACTC AGGAAGAGCT GCAGAAAATG
TGGATCCTGC GTAAGATCAT TCACCCGATG GGCGAAATCG ACGCAATGGA GTTCCTCATC
AATAAGCTGG CAATGACAAA GACCAACGAT GATTTCTTCG ACATGATGAA ACGCTCGTAA
 
Protein sequence
MNLTELKNTP VSELITLGEN MGLENQARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSGDS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
YDKPENSRNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVIAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL VDTGSKMDEV IYEEFKGTGN MELHLARKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFDMMKRS