Gene Hhal_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2055 
Symbolrho 
ID4710011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2258120 
End bp2259379 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content65% 
IMG OID639856528 
Producttranscription termination factor Rho 
Protein accessionYP_001003621 
Protein GI121998834 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.224008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCA CCGAGCTCAA GAGAAAGCCA GCCACCGAAC TCCTGGAGAT CGCCCAGTCC 
ATGGGCATCG AGGGCACTGC CCGCTCCCGG AAGCAGGACA TCATCTTCGC CATCCTCAAG
GCTCATGCCA AGAACGGCGA TTCCATCTAC GGCGACGGCG TCCTCGAGAT CCTCCAGGAC
GGCTTCGGCT TCCTGCGCTC CGCGGATGCC TCGTACATGG CCGGGCCGGA CGATATCTAC
GTCTCGCCCA GCCAGATCCG CCGGTTCGCA CTGCGCACCG GCGACACCAT CACCGGCAAG
ATCCGCCCCC CGAAGGACGG CGAGCGCTAC TTCGCCCTGC TCAAGGTCGA CCAGATCAAC
TTCGAGCCGC CGGAGGCGGC CAAGAACAAG GTCCTCTTCG AGAACCTCAC TCCGCTGTTC
ACCCGCGATC GCATGCGCAT GGAGCGGGGC AACGGCTCCA CCGAGGACCT CACCGCCCGG
GTCATTGACC TGGTGGCGCC TATCGGCAAG GGCCAGCGCG GGCTGATCGT CTCGCCCCCG
AAGGCCGGCA AGACGATGAT GCTCCAGAAC GTCGCCCAGA GCATCACCTA CAACTACCCG
GAGTGCTACC TCATCGTCCT GCTCATCGAC GAGCGGCCCG AGGAGGTCAC TGAGTTCGCC
CGCTCGGTGC TCAGCGCCGA GACGGTCTCC TCGACCTTCG ACGAGCCGGC CTCGCGCCAC
GTCCAGGTCG CCGAGATGGT CATCGAGAAG GCCAAGCGCC TGGTCGAGCA CAAGAAGGAC
GTGGTCATCC TGCTCGACTC CGTCACCCGC CTGGCGCGCG CCTATAATAC GGTGGTGCCG
TCGTCCGGTA AGGTGCTCAC CGGGGGCGTG GATGCCAACG CCCTGCACCG GCCCAAGCGC
TTCTTCGGTG CCGCGCGCAA CGTCGAAGAG GGCGGCAGCC TCACCATCCT GGCCACCGCC
CTGGTCGAGA CCGGCTCGCG CATGGACGAG GTGATCTACG AGGAGTTCAA GGGCACCGGC
AACATGGAGC TGCACATGGA CCGGAAGATC GCCGAGAAGC GCATCTACCC GGCCATCCAC
CTGAACCGCT CTGGGACCCG GCGCGAGGAA CTCCTGATGA CCCCCGAGGA GCTGCAGAAG
ACCTGGATCC TGCGCAAGCT CCTGCACAAC ATGGACGAGG TGGCCGCCAT CGAGTTCCTC
CTCGACAAGC TCAAGGACAC CAAGACCAAC ACCGAGTTCT TCGAGGCCAT GAAACGCTGA
 
Protein sequence
MNLTELKRKP ATELLEIAQS MGIEGTARSR KQDIIFAILK AHAKNGDSIY GDGVLEILQD 
GFGFLRSADA SYMAGPDDIY VSPSQIRRFA LRTGDTITGK IRPPKDGERY FALLKVDQIN
FEPPEAAKNK VLFENLTPLF TRDRMRMERG NGSTEDLTAR VIDLVAPIGK GQRGLIVSPP
KAGKTMMLQN VAQSITYNYP ECYLIVLLID ERPEEVTEFA RSVLSAETVS STFDEPASRH
VQVAEMVIEK AKRLVEHKKD VVILLDSVTR LARAYNTVVP SSGKVLTGGV DANALHRPKR
FFGAARNVEE GGSLTILATA LVETGSRMDE VIYEEFKGTG NMELHMDRKI AEKRIYPAIH
LNRSGTRREE LLMTPEELQK TWILRKLLHN MDEVAAIEFL LDKLKDTKTN TEFFEAMKR