Gene Hhal_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1103 
Symbol 
ID4709963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1195530 
End bp1197551 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content71% 
IMG OID639855575 
Productradical SAM domain-containing protein 
Protein accessionYP_001002681 
Protein GI121997894 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.204136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGACT CGAAGACCCC CTCGCTATTC TCCTACCGCC CTTACTGGGC GGCCCGCTTC 
GGCGTGGCCC CGTTTCTGCC CATGTCCCGG CGCGAGATGG GCGAACTTGG CTGGGACAGC
TGCGACGTCA TCCTGGTCAC CGGCGACGCC TACGTCGACC ACCCGAGCTT CGGGATGGCG
ATCATCGGCC GCCTGCTCGA GGCGCAGGGC TTTCGCGTCG GCATCATCGC CCAGCCCGAC
TGGCGCAACA CCAGCGACTT CCAGGCCCTG GGGGCCCCGA ACCTGTTCTT CGGCGTCACC
GCCGGCAACA TGGACTCCAT GGTTAACCGG TATACCGCCG ATGGCCGGCG CCGCAGCAAC
GACGCCTACA GCCCCGACGA TGAGGGCGGC CGCCGGCCCG ACCGCGCCAC CATTGTCTAC
AGCCAGCGCC TGCGCGAGGC CTACCGCGAC ACCCCCATCC TGCTCGGCGG CATCGAGGCC
AGTCTGCGCC GCCTGGCCCA CTACGACTAC TGGTCCGACA AGGTCCGCCG TTCGGTGCTG
CTCGACGCCA AGGCCGATCT GCTGCTCTAC GGCAACGCCG AGCGCGCCCT GGTGGAGGCC
GCCCACCGCA TCGCCGGCGG CGAGCCGGCC CGAGAGATCC ACGACGTGCG GGGCATCGCC
TACGCCCGCA CGGCCCCGGG CGCCGACGAA CCGGATCCGG CCCGGGCCGA AGGTGTGCTG
CGGGTCCCGG ACTGGGAGCA GGTACGCCAG GATGCGGACC GGTTTGCCGA GCTGGCACGC
ATCACCCGCC AGGAGACCAA TCCGCACAAC GCCCGCACCC TGGTCCAGGC CCACGGTGCC
CAGGAGGTGT GGATCACCCC GCCGCCGTTG CCCCTGAGCA CCGAGGAGCT GGACGCCCTC
TACGAGCTGC CCTACGCCCG GGCGCCCCAC CCGGCCTACG CCGGGCGCCC CATCCCCGCC
TGGGAGATGA TCCGCTTCTC GGTGAACATC ATGCGCGGGT GCTTCGGCGG CTGCAGCTTC
TGCTCCATCA CCGAGCACGA GGGGCGCCTG ATCCAGTCAC GCTCCCAGGA ATCGGTGCTG
CGTGAGATCG ACGAGATCGC CGAGCGCACC GAGGGCTTTA CCGGGGTGAT CTCGGACCTC
GGGGGACCCA GCGCCAACAT GTACCGCATG GGCTGCAAGG ACCCGAAGGC CGAGGCCCAC
TGCCGGCGCA TGTCGTGCCT CTACCCGCGC GTCTGCCGCA ATCTCGATAC CGACCACGAG
CCACTGATCA ACCTTTACCG GCAGGCCCGT CAGCGGCCCG CGGTGAAGAA GGTGCTGGTG
GCCTCCGGTG TACGCTACGA CCTGGCCATC CACTCCAAGG CGTACGTGCG GGAGCTGGTC
AGCCACCACG TCGGCGGGTA CCTGAAGATC GCCCCGGAGC ACACGGAGCC CGGGCCACTG
GCGCAGATGC TCAAACCGGG CATGGACACC TACGAGCGCT TCAAGGCGCT GTTCGAGGAC
TACAGCCGGC GAGCGGGCAA GGAGCAGTAC CTGATCCCGT ATTTCATCGC CGCGCATCCG
GGCACCACCG ACGAGGACAT GCTCGAACTG GCGTTGTGGC TCAAGCGCAA CGGCTTTCGG
CCCAACCAGG TCCAGGCCTT CCTGCCCACG CCCATGGCCC TGGCCACCGC CATGTATCAC
AGCGGCCGCG ACCCGCTGCG GCCCGTGGGG TCGGACGGCG GCACGCCGGT GCGCACCGCG
CGAGGTCTGC GCACGCGCCG GCTCCACAAG GCCTTCCTCC GTTACCACGA TCCGGAGAAC
TGGCCGCTGC TGCGCCGGGC GCTGCAGCGG ATGGGGCGCG AGGATCTGAT CGGCAACAGC
AAGCGCCACC TCATCCCCGC CCACCAGCCG CCGGGCACAG GGACCTCCGG CGAGGGGCGG
CGGACCCCGG ACGGCAAGCG CCGCCAGGGG CAGAAACCCC CGACGGCCGA CAACCGGCCC
CGGCGCGGCC CGCGCCGGCC GCGGGGCAAG GCCCGGCGCT GA
 
Protein sequence
MDDSKTPSLF SYRPYWAARF GVAPFLPMSR REMGELGWDS CDVILVTGDA YVDHPSFGMA 
IIGRLLEAQG FRVGIIAQPD WRNTSDFQAL GAPNLFFGVT AGNMDSMVNR YTADGRRRSN
DAYSPDDEGG RRPDRATIVY SQRLREAYRD TPILLGGIEA SLRRLAHYDY WSDKVRRSVL
LDAKADLLLY GNAERALVEA AHRIAGGEPA REIHDVRGIA YARTAPGADE PDPARAEGVL
RVPDWEQVRQ DADRFAELAR ITRQETNPHN ARTLVQAHGA QEVWITPPPL PLSTEELDAL
YELPYARAPH PAYAGRPIPA WEMIRFSVNI MRGCFGGCSF CSITEHEGRL IQSRSQESVL
REIDEIAERT EGFTGVISDL GGPSANMYRM GCKDPKAEAH CRRMSCLYPR VCRNLDTDHE
PLINLYRQAR QRPAVKKVLV ASGVRYDLAI HSKAYVRELV SHHVGGYLKI APEHTEPGPL
AQMLKPGMDT YERFKALFED YSRRAGKEQY LIPYFIAAHP GTTDEDMLEL ALWLKRNGFR
PNQVQAFLPT PMALATAMYH SGRDPLRPVG SDGGTPVRTA RGLRTRRLHK AFLRYHDPEN
WPLLRRALQR MGREDLIGNS KRHLIPAHQP PGTGTSGEGR RTPDGKRRQG QKPPTADNRP
RRGPRRPRGK ARR