Gene Hhal_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1958 
Symbol 
ID4710931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2153118 
End bp2154671 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content69% 
IMG OID639856431 
Producthypothetical protein 
Protein accessionYP_001003524 
Protein GI121998737 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCAGG ATCCCTTTGA CGTACCGCCC CTTGCCGCGG ACCCGGAGGC CAACGGCCCG 
CCGCACATGG AGACCGAAGC CACCGGGCAC GTGCGCCCGC ACCCGGCCAG TAGTGAGCAC
CAACAGGCGC TCGGCTTCCC CGGCGAGATC CCGGAGGACT GGCAGGAGCG CGCCCTGGCC
CGCATGGAGA CCCTGCTCGA GCGCAATCGC TCGCTGCGCG TCTACATGGA CGCCTGCGTG
CGCTGCGGCG CCTGCACCGA CAAGTGCCAC TTCTACCTGG GCACCTCCGA CCCGCAGAAC
ATGCCCGTGG CCCGCCAGGA TCTGATGCGC GACGTCTACC GGCGCCACTT CACCCCTGCC
GGGCGCAACT TCCCCAGCCT GGTCCGTGGC CGCGAGCTGA CCCGCGAGGT GATGGAGGCG
TGGTTCACCT ACTTTCATCA GTGCTCGCAG TGCCGGCGCT GCTCGGTCTT CTGCCCCTAC
GGCATCGACA CCGCCGAGAT CTCCATGGCT GCTCGGGAGA TCCTCGACGC CGCCGGCTTC
GGCCAGAAGT ACACCAACGA GATCATCGGC AAGGTCCACA AGGTGGGCAA TAACCTCGGC
CTCCCCGGGC CGGCACTGGA GGACACCCTT GAGGGTCTGG AGGAGGACCT CAAGGACGAC
ACCGGCCACG ACATCCGCAT CCCCCTCGAC CAGGAGGGCG CGGACATCCT GCTGGTCACC
CCGTCAGCAG ACTTCTTCGC CGAGCCCCAC GTCGACGGCC TGATGGGGTA CGCCAAGGTC
CTGCACCAGG CCGGGCTCTC CTGGACGCTG AGTTCCTACG CCTCGGAGGC GGCCAACTTC
GGGATGTTCA TCGGCAGCTA CGAGCAGATG AAGCAGATCG CCGAGCGCAT CCGCAAGGCC
GCCGTCGACC TGGGCGTCAA GCGCATCGTG GTCGGCGAGT GCGGCCACGC CTGGCGGGTG
GCGTACAGCT TCTGGAACAC CCTGGCCGGC ATCGGCCGCG GGGCCGACGC CGACGACGAG
TACGCGCGGG CCCTGCAGCG CCAGCTCGAT CCGCGTTACC CGGTACCGCA GCACATCTGC
GAACTCACCC AGGACCTGGT CGATCGTGGC GCCATCCGCC TTGATCCGGA GGCCAACAGC
CACTACGGCG GTGTCACCTT CCACGATTCC TGCAACGTCG CCCGCGCCTC CCGCATGGGG
CAGCGCCCCG GGGGGCAGCT GGAGATCCCC CGCCGGCTGC TGCGCGCCAG TGTCCACAAC
TACCAGGACA TGGCCGACGC CACCATCGGC GACCGGACCT TCTGCTGCGG GGGCGGGGGC
GGTCTGCTGA CCGACGAACT CATGGAGCTG CGCGTCAAGG GCGCGCAGCC GCGCGTCTCG
GCGCTGCGCG AGACCATGGA CGAACACGGG GTCGACCGCA TGGTGGCCAT CTGCGCCATC
TGCAAGGCCC AGTTCAGCAA GGTCCTGCCG TACTACGACA TCCCGCGCGA GACCATCATC
AGCCTCCACG AGGTGGTGGG CAACGCGGTG CGGCTGGATC GCAGCGAGGC TTGA
 
Protein sequence
MSQDPFDVPP LAADPEANGP PHMETEATGH VRPHPASSEH QQALGFPGEI PEDWQERALA 
RMETLLERNR SLRVYMDACV RCGACTDKCH FYLGTSDPQN MPVARQDLMR DVYRRHFTPA
GRNFPSLVRG RELTREVMEA WFTYFHQCSQ CRRCSVFCPY GIDTAEISMA AREILDAAGF
GQKYTNEIIG KVHKVGNNLG LPGPALEDTL EGLEEDLKDD TGHDIRIPLD QEGADILLVT
PSADFFAEPH VDGLMGYAKV LHQAGLSWTL SSYASEAANF GMFIGSYEQM KQIAERIRKA
AVDLGVKRIV VGECGHAWRV AYSFWNTLAG IGRGADADDE YARALQRQLD PRYPVPQHIC
ELTQDLVDRG AIRLDPEANS HYGGVTFHDS CNVARASRMG QRPGGQLEIP RRLLRASVHN
YQDMADATIG DRTFCCGGGG GLLTDELMEL RVKGAQPRVS ALRETMDEHG VDRMVAICAI
CKAQFSKVLP YYDIPRETII SLHEVVGNAV RLDRSEA