Gene Hhal_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1137 
Symbol 
ID4710121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1236536 
End bp1237846 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content72% 
IMG OID639855609 
Productamine oxidase 
Protein accessionYP_001002715 
Protein GI121997928 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.858264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGGG AGAATCCGCA AGTAGTTGTC ATCGGCGCCG GCCTGGCCGG CCTCGCCGCA 
GCACGTGATC TAGCGGCCGG CGGCGCCCGT GTGGAGTTGC TGGAAGCCGG CGACGAGGTA
GGCGGACGCG TTCGCACCGA CCGCCTGCGG CTGGACGGCA GCCCGGCGAG CGGCGAGGAG
CCGGCCTTCC AGCTCGACCG CGGTTTCCAG GTGCTACTGA CCGCCTACCC GGAGCTGCGC
AGCCGCGCCG ACCTGGACGC GCTGCAGCTG CGCCGGTACG CCCCCGGCGC CCTGATCCGC
ACCGAGGGCG GACTGCACCG GCTCAGCGAT CCGTTCCGCG CCCCGCAGGC GCTGCTGAAA
ACCCTACAGG CCCCGGTGGG CAGTCTCGGC GACAAGCTAC GCATCGCCCG CCTGCGGGCC
CGCCTGCGCC GCGGCGATGC CGAGCGTCCC CTGTACGGGC CGCAGCAAAG CAGCGCCGAA
GCCTTCGCCG CGGAGGGCTT CTCGGCACGG ATGGTCGAGC GCTTCCTGCG GCCGCTATTC
GGCGGCGTTC TCCTCGACCC CCAACTCCAG ACCTCGGCCC GACTGCTGAA CTTCGTCTTC
CGCATGTTCG CCGAGGGCGA TGCAGCCATC CCCGCCGGCG GCGTCGGCGA CCTGCCGCGC
CAGCTGGCTG CTCAACTCCC CGCCGACCGG GTGAGGCTGC GCCTTGGGAC TGCCGCACAG
GGCATTGAGC AGGGCCCCAT CGTATCGCTG GCGGGTGGCG AGCAACTGAG CGCCGATGCC
GTCGTGGTCG CCACCGACGG GCCAGCCTTC ACCCGTCTGA CCGGGCACCC CACCGCCGCG
GGCCGGCCAG TAACCTGCCT GCAGTTCGCC GCGCCGGAAC CGCCGGTGAC CGAACCGTTG
ATCGTACTCA ACGGCGAGGG GGAAGGGCCC ATACTGCACC TGGCGGCACC CAGTGTGGTG
GCCCCCGAAT ATGCGCCGCC TGGCTGGCAC CTGGTCAGTG CCACTGTGCT CGGCGAGGGG
CAGGACCGGG ACGACCCCTC CCTACAACGC GAGGCCGTCA GACAGTTGCG CGACTGGTTC
GGGCCGGGGG TGGATCACTG GCGCCCGCTG CGCCTGGAAC GCATCCCCTA CGGACAGCCG
GTTCAGACGC CGCCGGCGCT GACCCACCCG TACCAGCCGG CGCGGCTCGG AGGTGACATC
TACGCCTGCG GCGATCACCG CGCCCACGGA TCCCAGCACG GCGCATTACG CTCGGGCGCA
CTCGCCGCCG ACGCGGTGCT TGCCGATCAG GGAGGGGGAT CTGCCGCCTA G
 
Protein sequence
MPRENPQVVV IGAGLAGLAA ARDLAAGGAR VELLEAGDEV GGRVRTDRLR LDGSPASGEE 
PAFQLDRGFQ VLLTAYPELR SRADLDALQL RRYAPGALIR TEGGLHRLSD PFRAPQALLK
TLQAPVGSLG DKLRIARLRA RLRRGDAERP LYGPQQSSAE AFAAEGFSAR MVERFLRPLF
GGVLLDPQLQ TSARLLNFVF RMFAEGDAAI PAGGVGDLPR QLAAQLPADR VRLRLGTAAQ
GIEQGPIVSL AGGEQLSADA VVVATDGPAF TRLTGHPTAA GRPVTCLQFA APEPPVTEPL
IVLNGEGEGP ILHLAAPSVV APEYAPPGWH LVSATVLGEG QDRDDPSLQR EAVRQLRDWF
GPGVDHWRPL RLERIPYGQP VQTPPALTHP YQPARLGGDI YACGDHRAHG SQHGALRSGA
LAADAVLADQ GGGSAA