Gene Sde_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3656 
Symbolrho 
ID3966613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4636973 
End bp4638232 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content48% 
IMG OID637922753 
Producttranscription termination factor Rho 
Protein accessionYP_529123 
Protein GI90023296 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.04416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.624434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA CCGACTTAAA ATCAAAACCC ATTGAAGAAC TTATCGATCT TGCAAAAGAA 
ATGGGCATGG AGAGCCTAGC GCGCTCCCGC AAACAAGACG TCATCTTCAA CATTCTTAAA
CGCCACGCCC GCAGCGGTGA AGATATCTAC GGCGACGGTG TATTAGAAAT ACTGCAAGAT
GGCTTCGGCT TTTTGCGCTC TGCAGGCGCT TCTTACCTAG CTGGCCCAGA CGATATTTAC
GTTTCCCCCA GCCAAATTCG CCGCTTTAAC TTGCGCACCG GCGATACCAT AGCGGGGAAA
ATTCGCCCCC CTAAAGAAGG CGAACGTTAT TTTGCCCTTC TTAAAGTAAA CGAAATTAAT
TTCGACAAGC CAGAAAACTC CCGCAACAAA ATTCTCTTCG AAAACCTTAC TCCGCTATTC
CCTCAGGAAC GCCTCGAGTT AGAAACCGGT AACGGCTCTA CCGAAGACTT AACCGGCCGC
ATCATCGACC TAGTAAGCCC CATTGGTAAA GGTCAGCGCG GTTTGATTGT TGCACCACCT
AAAGCTGGTA AAACCATCAT GATGCAGAAT ATTGCGCAGG CTATTACGCG CAATAACCCT
GAGTGCCATT TAATTGTTTT ACTTATCGAC GAACGCCCAG AAGAAGTAAC CGAAATGCAG
CGCTCTGTGC GCGGCGAGGT GGTTGCCTCT ACCTTCGACG AACCACCTTC GCGCCACGTA
CAAGTAGCCG AAATGGTTAT TGAACGTGCT AAGCGTTTAG TTGAGCATAA AAAAGATGTA
ATTATTCTGC TCGACTCCAT CACTCGTTTG GCGCGCGCTT ACAACACCGT TATTCCTTCA
TCAGGTAAAG TACTTACCGG TGGTGTTGAT GCCCACGCAC TAGAGCGCCC AAAGCGTTTC
TTCGGTGCTG CGCGTAACAT TGAAGAAGGC GGCAGCTTAT CTATTGTGGC TACCGCACTT
ATCGATACCG GCTCTAAAAT GGATGAAGTT ATCTACGAAG AGTTTAAAGG TACCGGTAAC
ATGGAACTGC ACCTCGACCG CAAAATTGCC GAGCGTCGCG TTTACCCTGC CATTAACATC
CGTCGCTCCG GCACCCGTCG CGAAGACCTG TTAATGAAAG AAGAGGAACT ATCTCGCGTA
TGGATTCTGC GCAAACTCCT CCACGATATG GAAGACGTAG CCGCCACAGA ATTCCTTTCC
GACAAGCTAA AAGACTTCAA AACTAACAAC GAGTTTTTCT TGTCTATGCG CTCGAAGTAA
 
Protein sequence
MNLTDLKSKP IEELIDLAKE MGMESLARSR KQDVIFNILK RHARSGEDIY GDGVLEILQD 
GFGFLRSAGA SYLAGPDDIY VSPSQIRRFN LRTGDTIAGK IRPPKEGERY FALLKVNEIN
FDKPENSRNK ILFENLTPLF PQERLELETG NGSTEDLTGR IIDLVSPIGK GQRGLIVAPP
KAGKTIMMQN IAQAITRNNP ECHLIVLLID ERPEEVTEMQ RSVRGEVVAS TFDEPPSRHV
QVAEMVIERA KRLVEHKKDV IILLDSITRL ARAYNTVIPS SGKVLTGGVD AHALERPKRF
FGAARNIEEG GSLSIVATAL IDTGSKMDEV IYEEFKGTGN MELHLDRKIA ERRVYPAINI
RRSGTRREDL LMKEEELSRV WILRKLLHDM EDVAATEFLS DKLKDFKTNN EFFLSMRSK