Gene Hhal_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0149 
Symbol 
ID4710702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp171720 
End bp173387 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content66% 
IMG OID639854607 
Productcytochrome c family protein 
Protein accessionYP_001001745 
Protein GI121996958 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAGCTC GGCGCTGGTT GGCAGGCCTG CTGGTCCTTG CGGTAGGGGT GAGTTCCGCG 
GCGGGTGCCG AGGATGTCGA ACCGCCCATT CCTGGCGAAG AGGCGGCGGC GGAGCCCGGT
TCCACGGCGG ACCATTCGCA GTTCTCGATC CTTGAGGGGC CTTTCGAGAC CGGCCCCGAG
GTCACTGAGG CGTGCCTGCA ATGCCACACC GAAGCTGCCA AACAGGTCCA CAGCAGCATC
CACTGGACCT GGACCTACGA GCAGCCCGAG ACCGAGCAGA CCCTCGGTAA GCGTTACGTG
CTCAACAACC TGTGCATGGG GATCGCCGGC AGCTATGAGC GCTGCTCCTC CTGCCACGTC
GGGTATGGTT GGGAAGATCG CGACTTCGAC TTCACCGCCG AGGAGAAGGT CGACTGCCTG
GTCTGTCACG ACACCACCGG CGATTACGTC AAGTTCCCCA CCGCCGCCGG CCATCCGCCC
TACGAAGACA CCGAGTTCCG CGGCACCCTG TTCGAGGCGC CGGACCTGGC CCATGTGGCA
CGGAACGTCG GCGATACCAG CCGTGCCACC TGCGGCAGTT GCCATTTCGA GGGGGGTGGG
GGCAACGCCG TCAAGCACGG CGATCTCGAC AGCTCGTTAC TCGACCCGCC GCGCTCCGTC
GACGTGCACA TGACCCCGGA CGGCGCTGAC TTCAGCTGCT CGAACTGCCA CGAATTCACC
GGCCACATCC AGAGCGGCAG CCGTTACCAC CTGACGATGC CGGACACCGA CGATGCCCCG
GTGCCGGCGC GTCCGCAGGA CAAGCCGGCC TGTGTCGCCT GCCACGGCAG TGAGCCCCAT
GAAGGGCGCA TCCACGACAA GCTCAACGCC CATGGCGAGT TCATCGCTTG CCAGACCTGC
CACGTGCCGG AGATCGCCCG GGGCGGCTAT CCGACCAAGA CCCTGTGGGA CTGGTCCGAG
GCCGGCCGAC TCGACGATGA CGGTCAGCCC ATTGTCGAGA AGGACGACGA GGGGCGGGTG
GTCTACGACG GCATGAAGGG GGTGTTCGAG TGGGACGAGG ACTACCCGCC GGATTATCGC
TGGTTCGACG GCAACATGGT CTACACACTG CCCGACGACA CCATCGATCC GGACGACGAG
GTTCCCGTCA ATCGGCCCCA GGGGCAGCCG GGCGAGGAGG GCGCCAAGAT CTGGCCGTTC
AAGATCATGT ACGGCCAGCA GCTGTACGAT GCTGAGCATC ATACGCTGCT GGTGCCCCAG
CTCTTCGGCA AGGAAGGCGA TGAAAACGCC TACTGGCAGA ACTACGACTG GGACCGCGCC
ATCGAGGCCG GCATGGAAGA GGCGCGTGCC GTGGGACAGA CCGAGATGGT ATACAGCGGT
GAGTACGGCT TCGTCGAGAC TCGGATGTAC TGGCCGGTTA ATCACATGGT GGCGCCGGCC
GAGGAGAGCG TCGCCTGCGT CGACTGCCAC AGCCGGGATG GCCGCATGGC CGGGCTGGAT
GGGGTCTACG TGCCGGGGCA GGATCGCCAT CCGCGGATCG AGGCTGTGGG CTGGACGGCG
GTCTGGCTGA CCCTGTTCGC GGTGCTGGGC CACGGCGGCG TGCGCTACTA CCTCTACCGT
CGAGAGCGCC TCGGCAATCG CCATGGCGAG GAGGGCTCGA GCTCGTGA
 
Protein sequence
MIARRWLAGL LVLAVGVSSA AGAEDVEPPI PGEEAAAEPG STADHSQFSI LEGPFETGPE 
VTEACLQCHT EAAKQVHSSI HWTWTYEQPE TEQTLGKRYV LNNLCMGIAG SYERCSSCHV
GYGWEDRDFD FTAEEKVDCL VCHDTTGDYV KFPTAAGHPP YEDTEFRGTL FEAPDLAHVA
RNVGDTSRAT CGSCHFEGGG GNAVKHGDLD SSLLDPPRSV DVHMTPDGAD FSCSNCHEFT
GHIQSGSRYH LTMPDTDDAP VPARPQDKPA CVACHGSEPH EGRIHDKLNA HGEFIACQTC
HVPEIARGGY PTKTLWDWSE AGRLDDDGQP IVEKDDEGRV VYDGMKGVFE WDEDYPPDYR
WFDGNMVYTL PDDTIDPDDE VPVNRPQGQP GEEGAKIWPF KIMYGQQLYD AEHHTLLVPQ
LFGKEGDENA YWQNYDWDRA IEAGMEEARA VGQTEMVYSG EYGFVETRMY WPVNHMVAPA
EESVACVDCH SRDGRMAGLD GVYVPGQDRH PRIEAVGWTA VWLTLFAVLG HGGVRYYLYR
RERLGNRHGE EGSSS