Gene Haur_3377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3377 
Symbolrho 
ID5735238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4256731 
End bp4257981 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID641280524 
Producttranscription termination factor Rho 
Protein accessionYP_001546141 
Protein GI159899894 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000542218 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCAACC TTGCTGATTT AGAAAACAAA ACGTTGAGCG ACCTCCAAGA AATGGCTCGG 
GAACTCGATA TCTCTGGCTA TAGTCGCCTC AAGAAACAAG ACCTCATTTA CAAATTAATT
CAGGCTCAAA CTGAACAAGC AGGCAATATT TTCAATACGG GGATTCTCGA TATCGTTTCT
GACGGTTTCG GGTTTCTGCG CAGCGACCGT ATGTTGCCTG GCCCCGATGA TGTGTATGTC
TCGCAAACTC AAATTCGCCG CTTTGGCCTC CGTACTGGCG ACCGCATCTC CGGCCAGATT
CGTCCTCCCA AAGAAAGTGA ACGCTATTAT AGTTTGCTGC GGGTTGAATT AGTGAATGGT
ATGGACCCTG AGCAAGCTCG TAAGCGCCCC CATTTTGAAA AATTGACCCC GATTTTTCCC
AATGAACGCT TTATTTTGGA GACTGAGCCG CAAATTCTTT CAACTCGTTT GGTTGATTTG
ATTGCGCCAA TCGGTCGCGG TCAGCGTGGG CTGCTGGTTT CGCCCCCCAA AGCTGGTAAA
ACGATGCTGA TGAAGGCGAT TGCTAATGGC ATCACCACCA ATTATCAAGA TGCCCATTTG
ATGGTCTTGT TGATTGGCGA GCGTCCCGAA GAAGTCACCG ATATGCGCCG CTCAGTCCGC
GGTGAGGTGA TTGCTTCGAC CTTCGATGAG CCAGTTGAAG ATCATACCAA AGTTTCTGAA
ATGACCCTCG AACGAGCTAA GCGCTTGGTC GAAGGCGGCC AAGATGTGGT GATCTTGATG
GACTCTATCA CCCGTTTGGC ACGGGCTTAC AACTTGGATA TGCCACCATC AGGCCGAACC
TTGACTGGTG GGATCGACCC AGTGGCCTTG TACCCTCCAA AACGCTTCTT TGGTGCTGCC
CGTAACATCG AAGGCGGCGG CTCGTTGACA ATCATTGCTA CCTGTTTGGT CGATACTGGT
AGCCGTATGG ACGACGTGAT TTACGAAGAA TTCAAGGGTA CTGGCAACAT GGAATTGCAC
TTGGATCGGC GCTTGGCTGA ACGCCGCACC TATCCGGCAG TCGATATTGC CCGCTCTTCA
ACCCGTCGCG ATGAGTTGTT GCTCTTGCCT GAGCAATTGC GCCAAGTTTG GACGTTGCGC
CGTATGGTCA GTATGTTGGG CGAAAACGAA GGCACTGAAT TGGTGCTGAC GCGCATGTCC
AAAACCCGTA CCAACGACGA GTTCTTGCTC ACTCTCAACA AGAGCCTCTA G
 
Protein sequence
MVNLADLENK TLSDLQEMAR ELDISGYSRL KKQDLIYKLI QAQTEQAGNI FNTGILDIVS 
DGFGFLRSDR MLPGPDDVYV SQTQIRRFGL RTGDRISGQI RPPKESERYY SLLRVELVNG
MDPEQARKRP HFEKLTPIFP NERFILETEP QILSTRLVDL IAPIGRGQRG LLVSPPKAGK
TMLMKAIANG ITTNYQDAHL MVLLIGERPE EVTDMRRSVR GEVIASTFDE PVEDHTKVSE
MTLERAKRLV EGGQDVVILM DSITRLARAY NLDMPPSGRT LTGGIDPVAL YPPKRFFGAA
RNIEGGGSLT IIATCLVDTG SRMDDVIYEE FKGTGNMELH LDRRLAERRT YPAVDIARSS
TRRDELLLLP EQLRQVWTLR RMVSMLGENE GTELVLTRMS KTRTNDEFLL TLNKSL