Gene Hhal_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0741 
Symbol 
ID4711337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp829445 
End bp830503 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID639855205 
Producthypothetical protein 
Protein accessionYP_001002324 
Protein GI121997537 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGACC TGCGTGCCGA GCACGCCGGC CTGGAAGAGG AGCTGGAGTC GCTGCGTCAG 
CGCCGCTCCA ACATCCCGGC GCGCATGCTG GAGCTGCGCC AGCGCCTGTG CCAGGAGCTG
AGCCTCGATG AGGAGGCCGT CCCCTTCGCC GGTGAGCTCC TCCAGGTCCA TGAGCAGGAG
ACGGACTGGG AGGGGGCCAT CGAGCGGCTG CTCCACAACT TCGGCCTCTC GCTCCTGGTC
CCGGAGGCGC ACTACCGCGC CGTGGCCGGC TGGGTGGATC GCAGCCACCT GCGTGGCCGG
CTGGTCTACT ACCGGGTGCG CGAGCCGCGC AGCAGTGAGC CGCCCGCGCT GCACCCCGAG
TCCCTGGTGC GTAAGATCGC CATCCGTCCC GACTCGGCGT TCTACGCCTG GCTGGAGCAG
GAGCTGGGTC GGCGCTTCGA CTACGCCTGC ACCCGGGATC TGGAGACCTT CCGGCGCGAG
GATCGCGCCA TCACCCCGGC CGGGCAGATC AAGGCGGGCG GGGATCGCCA CGAGAAGGAC
GACCGCCACC GCATCGACGA CCGCTCCCGC TTCGTCCTCG GCTGGTCCAA CGAGGCGAAG
ATCGCCGCCC TCCAGGAGGA CGACCTGCCC CGCTTCGAGG CCCGGTTCAA GGAGCTGCTC
AACGAGAACA CCATCCGCGA GATCGCCAAC TTCAACGCCC AGCTCAACAA AGAGCGCGCG
CAGATCCGCG AGCGCATCGC AACCATCAAC GCCTCGCTGT TCGATATCGA CTACAACTCG
GGGCGCTACA TCGAGCTGGT CGCCGACACC ACCACCGACC CCGAGGTCCG CGACTTCCGC
GAGCAGCTCC GGGCCTGCAC CGAGGATACG GTGACCGGCT CCGAGGACGC CCAGTACAAC
GAGCGCAAGT TCCTCCAGGT CAGGGCGATC ATCGAGCGCT TCGTCGCCAG TGTCGGCTTC
GTCCACGGCG AAGGCGGGTG CTTCTCGCTG CTGCGCCATC TGAGCATCGA GGCGTACCAC
GCCGAAAAGG CCATCCGGCG CGCGGCAACC AGCGGATGA
 
Protein sequence
MRDLRAEHAG LEEELESLRQ RRSNIPARML ELRQRLCQEL SLDEEAVPFA GELLQVHEQE 
TDWEGAIERL LHNFGLSLLV PEAHYRAVAG WVDRSHLRGR LVYYRVREPR SSEPPALHPE
SLVRKIAIRP DSAFYAWLEQ ELGRRFDYAC TRDLETFRRE DRAITPAGQI KAGGDRHEKD
DRHRIDDRSR FVLGWSNEAK IAALQEDDLP RFEARFKELL NENTIREIAN FNAQLNKERA
QIRERIATIN ASLFDIDYNS GRYIELVADT TTDPEVRDFR EQLRACTEDT VTGSEDAQYN
ERKFLQVRAI IERFVASVGF VHGEGGCFSL LRHLSIEAYH AEKAIRRAAT SG