Gene Hhal_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2278 
Symbol 
ID4709469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2499555 
End bp2501993 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content70% 
IMG OID639856754 
Producthypothetical protein 
Protein accessionYP_001003844 
Protein GI121999057 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCCGAC TCCCCCGTCC CCACGCCTTG CTCACTGCCA GTGCCGCGGC GGCCCTGATG 
GCCAGCGGCT GCGCTGCCCT CGATGAGGAG TACGAGGCCG CCCTACTGCT CAGCGACCTA
GGCGCCGAGG GCCACGAAAA GACCCGCCTG CAGGATCGCG GCGGTGATCC GGAAGTGACC
TCCGTCCGCT ACGACAAGGG CCAGGGTGAG CGCGAGGCCG ACCTCTACCG GCCCGACGGC
GAGGCCCGCG GGAACCTGGT GCTGATCCAC GGGCTGACCG AGCAGGGCAA GGACGATCGC
CGACTGGTGG AGATGGCCGA GAGCCTGACC CGGGTCGGTT ACCGGGTCCT GGTGCCCGAG
GTAGAGGAGC TCCGTGAGCT GCGCTGGGGG CCGGAGAATC GCGACGACGT GGCTGATGCG
GTGCGCCATA TGGCCAACCG GGAGGCGAGC CGCGGTCTGC CCCTGGGGGT CAGCACCCTG
AGCCTGATGA GCGGCCCGGT GCTGCTCGCC ACCGCCGATG ACGAATTGCG TCGGCGGGTC
GAGTTCGTCG CCCTGATTGG CGGCTATTAC GACATCGAGG CGTGGCTGCG CTACGTGACC
ACCGGTCATG ACCCGCTGGC GGATGCCCGG GACGACCCGG AGCCCCGCCC GGAAAGCCGC
TGGGTCCTCC TGCAGGCGCT GGCGGCCCGG GTGGACGAGG ACGACCGGGA GTGGATGCAG
CGCATCGCCG AACGCCGCCT GGACGATCCC GAGGCAGATA TCACCGCCGA ACGGGCGGCG
CTGGGCGAGG AGGCAGCCGC CCTGGTGGAT CTGCTGGTCA ACGACGACCC GGAGGCCTTC
GATGACGACC TGGCGGAGAT CCCCGCGGCG TACCGCGAGG CGCTGGACGA GCTGGATCCC
TCCCGCCACG ACTGGTCGGA CTACCACCCG GAGTTGCTGC TGATCCACGG CACCAACGAC
CCGGTGGTGC CCTTCTCCCA CAGTGAGGCG CTGGAGGCCG CTGCCCCGGA TGCGCACCTG
TACCGCAGCG CCGGACTCGC CCATGTGGAC CTGGAGGGCG GACTGTTCGA CAGCGTCCGG
CTCTGGCGGG CGGCCTCGGC GCTGCTGGAC GTCCGCCTGG ACCCGGAGCA GCCCCTGGCA
TCGGACCCGG GACGGAATGT CTACATCCCG CCGCGCGGTG AACTCGAGCG CACCCTGCGC
GTGGGCGCCG TCTACAACGA TGACGAGATC CAGATCCGTT ACGAGTTCGC CACCGAGGCC
CCGTCCTGGT ACCACCAATA CTGGATCTAC GAGGTGGAGG GGCGCGGGAC CGGCGGCGAG
TGGGTGCAAT ACGGCAGTGG CGGCCCGGAG CCCGATGAGC ACGGCCTCTA CGAGGATCGC
ATCTCCATGC TCCTCGACGA TGGGGATCTG GGGCTCGATC GCTACGGCGG CTTCATGACC
GCCCACGAGG GCATGCGCGG GCTCACCGGG GCCGCCGAGT CCGAGGAGGT CGAGGCCCAT
CCCCACCTGG GCGAGACCCT GGGGCGCTCG GACGTGCGCA AGTACATCCC CCAATCGCGG
GAGGACGGGG ACAACGGTGG CGCGGACTGG CAGGACGTGC GTGATGAGGC CGAGCTCGAG
GCCATGCGCG AGCGGGGCGA GTTCCTCGAC CTCTGGCAGT GGCGCGCCCA CCGTTCCCAC
CCGCTGGGCT ACGCCGACAA CGGCTACGTC CTGGAGTACC GCCACAGCTC CGAGGGGCGG
GGGATGTTCA CCGACAACTG GGACGACGAG GCCGATCAGC CCCGTTGGAT GTACGACCCG
GACGAGGCGG GCTTCCGGGC CCTGGAGCGC GACCGGCTAC TGGATGGTGC CTATGACCAG
GACGATCTCT ACTACCTGAG CGAGGGGCAC GCCACCGACT TCGATCCGGA CCACCACTGG
GAGGACGGCG ACGTCCTGCC CCAGCGCTTC CTGCAGGAGC CCGATGGGAG CCGTGGGGCG
ATCCGCGCCG CTGGCGGCTA TGAGGACGGT GCCTGGCGGG TGCGCCTGAC CCGCTCGCTC
GAGGCGCCGG AGCCCACCGA TAGCCACACC CTGGAGCCGG GCGGGGTCTA CGACGTGGCC
TTTGCCGTTC ACGAGGGCGT GGGCCAGCGC TGGCACCGGG TCTCACTGCC GCAGACGCTC
GCCCTGGCCG AGGAAGCGGC GGATGCGCCG GAGGCCGACA TCGTGGCCAC GCACACGGAG
GGCGATCTGG ACGATGCCGA CGTGGAGTGG ACTGAGGTCG GGTTGATCTA CCCGGGGCAG
ATGACCTGGG ACTGGCTCAC CGATCGCAGC CCCGCCGGCC ACCCCGGTGC CGGTCATGTC
ATCGGTGGTG AGCGCGCCAT CGGCGACGAG CACCGGCTGC CAAGGTTGCA GGACTACCTG
CTCTACGAGG AGCGCCGGCG CATCGATCAG CAGGACTGA
 
Protein sequence
MSRLPRPHAL LTASAAAALM ASGCAALDEE YEAALLLSDL GAEGHEKTRL QDRGGDPEVT 
SVRYDKGQGE READLYRPDG EARGNLVLIH GLTEQGKDDR RLVEMAESLT RVGYRVLVPE
VEELRELRWG PENRDDVADA VRHMANREAS RGLPLGVSTL SLMSGPVLLA TADDELRRRV
EFVALIGGYY DIEAWLRYVT TGHDPLADAR DDPEPRPESR WVLLQALAAR VDEDDREWMQ
RIAERRLDDP EADITAERAA LGEEAAALVD LLVNDDPEAF DDDLAEIPAA YREALDELDP
SRHDWSDYHP ELLLIHGTND PVVPFSHSEA LEAAAPDAHL YRSAGLAHVD LEGGLFDSVR
LWRAASALLD VRLDPEQPLA SDPGRNVYIP PRGELERTLR VGAVYNDDEI QIRYEFATEA
PSWYHQYWIY EVEGRGTGGE WVQYGSGGPE PDEHGLYEDR ISMLLDDGDL GLDRYGGFMT
AHEGMRGLTG AAESEEVEAH PHLGETLGRS DVRKYIPQSR EDGDNGGADW QDVRDEAELE
AMRERGEFLD LWQWRAHRSH PLGYADNGYV LEYRHSSEGR GMFTDNWDDE ADQPRWMYDP
DEAGFRALER DRLLDGAYDQ DDLYYLSEGH ATDFDPDHHW EDGDVLPQRF LQEPDGSRGA
IRAAGGYEDG AWRVRLTRSL EAPEPTDSHT LEPGGVYDVA FAVHEGVGQR WHRVSLPQTL
ALAEEAADAP EADIVATHTE GDLDDADVEW TEVGLIYPGQ MTWDWLTDRS PAGHPGAGHV
IGGERAIGDE HRLPRLQDYL LYEERRRIDQ QD