Gene Hhal_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2201 
Symbol 
ID4709549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2415278 
End bp2416375 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID639856676 
Producthypothetical protein 
Protein accessionYP_001003767 
Protein GI121998980 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000750597 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGCTGA TTCGTAGCTG GCTGCGACGC ACATTCAACG ACCCTCAGAT CGCTGCCTTC 
ATCGTCTTGA TGGTGGTGGG CCTCGGGGCG CTGCTGATGC TCGGCAGCAT TCTCGCCCCG
GTGATCGCCG CCGTGGTGAT CGCCTACTTG CTCGAAGGGG TGGTGAGGGG TTTCGAACGC
GCCGGGGTGC CCCGGATGCT GGCGGTGGTG ATCGTCATCC TGTTCCTCAC CACCTTCCTG
GTCCTGGTGC TCTTTGCGCT GATCCCGTTG CTCTACCGCC AGGTGGGCCA GCTGGTGGAT
CAGCTGCCGG CGATCCTCGC CCAGGGGCAG ATGCTGCTGC TGCAGTTGCC CGAGCATTAC
CCGCAGCTCT TCTCCGAGGC GCAGATCCGC GAGATGCTCG ATACCGCCCG GCGGGAGATC
ACCGACCTGG GGCAGCGGGT GGTGGCCTCG GTGACGGTTC AGTCGCTGAT GATCCTCGGC
ACGCTGGTGA TCTACGCGGT GCTGGTGCCA TTTTTGGTCT TCTTCCTGCT CAAGGACAAA
CGGCTGTTGC TGCAGTGGGT CAGCAACCAT ATGCCCCGCC ACCGTGCCTT TGCCTCGGAG
GTGTGGCTGG ACGTCGATCA GCAGATCGGC AACTACGTCC GCGGCAAGTT CATCGAGATC
CTGATCGTCT GGGCGGTCAC GTACATCACC TTCTCCCTGT TGGGGGTGCC GTTTGCCATG
CTGCTCGCGG TGGCCACCGG CTTGTCGGTG ATCATCCCCT ATGTCGGGGC CTTCGTGATG
ACCGTGCCGG TGGCGCTGAT CGCCTACTTC CACTTCGGGG TGAGCCAGGA GCTGGTCTAC
GTCCTGGTGG CCTACACCAT CATCCAGGTG CTCGACGGCA ACGTCCTGGT GCCGCTGCTC
TTCTCCGAGG TGGTGAACCT CCACCCGGTG GCGATCATCG TCTCGATCCT GGTCTTCGGC
GGGATCTGGG GATTCTGGGG GATCTTCTTC GCCATCCCGT TGGCCACCTT TATCCAGGCG
ATCATCAAGG CGTGGGTGCG GCGCCGCAAG CCGCCGGATG ACGAATCCGC AGGGGTCGAG
GAGGAGCTGG TCCCCTGA
 
Protein sequence
MELIRSWLRR TFNDPQIAAF IVLMVVGLGA LLMLGSILAP VIAAVVIAYL LEGVVRGFER 
AGVPRMLAVV IVILFLTTFL VLVLFALIPL LYRQVGQLVD QLPAILAQGQ MLLLQLPEHY
PQLFSEAQIR EMLDTARREI TDLGQRVVAS VTVQSLMILG TLVIYAVLVP FLVFFLLKDK
RLLLQWVSNH MPRHRAFASE VWLDVDQQIG NYVRGKFIEI LIVWAVTYIT FSLLGVPFAM
LLAVATGLSV IIPYVGAFVM TVPVALIAYF HFGVSQELVY VLVAYTIIQV LDGNVLVPLL
FSEVVNLHPV AIIVSILVFG GIWGFWGIFF AIPLATFIQA IIKAWVRRRK PPDDESAGVE
EELVP