Gene Hhal_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1424 
Symbol 
ID4709973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1538070 
End bp1539407 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID639855891 
Producthypothetical protein 
Protein accessionYP_001002993 
Protein GI121998206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.294946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGG CAAGCGTCGT CGGCGCCCGG TCCGGGCGCC GTGTCCGCCC GCCCCGTTGG 
ATCGCGGGCC TGGCCTTGGC CGCCGGGGTC GGCTCGCCGG CCCTGGCCGA CTGGGACGAC
GATCCGTGGG CGGACGACCC CTGGGACGAG GAGGAGCAGT GGCTCCCTTT TGAGGTGGAC
GGCTTCGTGG AGATCGCCGG CGGCCACCAC ACCCGGGATA ACCAGGTCCT GGACAAGGAC
TACAACCTGG CCGAGGCCCG CCTGCGCCTG GAGGCGCGGG GGGACTGGCG CCGATTCGAT
TTCCGGGTCC GCGGTGACGG CGTGGCCGAC CAGGTCAAGG AGGAGATCCG CGGCGAGCTG
CGCGAGGCTC GGGTTGCCTT CCCCGTTGGG CAGCGGCTGG ACATGCGGGT CGGTCGTCAG
GTACTGGCCT GGGGGACCGG CGATCTGCTC TTCATCAACG ATCTCTTCCC CAAGGATTTC
AACTCGTTCC TCACCGGCCG CGACGAGGAC TACCTGCAGG GTCCCTCCGA TGCGGTGCGC
GGCACCTGGT ACGGCGACAA CGTGACCCTG GACCTGGTCT GGACCCCCGT CTTCGAACCC
GACGATTATC CGAACGGGGA GCGGCTGAGT TACTTTGACC TCCGTGAGGA GCGCCAAACC
GAGCAATCGC CACCCGCCGA CGACCCGGAC AGCTTCCCGG ACGACGGTGA GCTGGCGGCG
CGGCTGACCC ACCGCATCGG CAGCGCCGAG CTGGCCGGGT ACTTCTACCG CGGCTTCTTC
CCGCAGCCGG AGGAGCAGGC CAACGACCGC CTCACCCACG CCCGGCTCAA CGCCTACGGC
GCCAGCATCC GGGATCGGCT CGGCCCGGGC ATTGCCAACG CCGAGGTCGG CTACTACGAC
TCGGTGGACA ACCGCGACGG CGATCGCAGC GTGCAGGTCC CGAACTCGGA GTTCCGGGCG
TTGCTGGGCT ACACCTGGGA GGCGGCGACA AACTTCGACG TGGGTCTGCA GTACTACCTG
GAGTGGCTAC AGGATTACGA CGACCTGGAG GCCCGATGGC AGGCGGACGA CGACCTGCTC
CCCGAGGAGT ACCGCCAGGT GCTCACCACC CGGCTCACCT ACAGCGTGTG GCGGGACAAC
CTGATCGGCT CGCTGTTCGC CTTCTACTCG CCGGACGACG AGGATTACTA CCTACGGCCG
TCGGTGCGCT ACCGCGCCTC CGATGCATTG AGCTATTCGG TGGGAGGAAA CCTGTTCGGC
GGCGACAGCG ACCACACCTT TTATGGGCAG TTCAAGCGGG ATTCCAACCT CTACGCCCGA
GTCCGCTATC GCTTCTGA
 
Protein sequence
MSTASVVGAR SGRRVRPPRW IAGLALAAGV GSPALADWDD DPWADDPWDE EEQWLPFEVD 
GFVEIAGGHH TRDNQVLDKD YNLAEARLRL EARGDWRRFD FRVRGDGVAD QVKEEIRGEL
REARVAFPVG QRLDMRVGRQ VLAWGTGDLL FINDLFPKDF NSFLTGRDED YLQGPSDAVR
GTWYGDNVTL DLVWTPVFEP DDYPNGERLS YFDLREERQT EQSPPADDPD SFPDDGELAA
RLTHRIGSAE LAGYFYRGFF PQPEEQANDR LTHARLNAYG ASIRDRLGPG IANAEVGYYD
SVDNRDGDRS VQVPNSEFRA LLGYTWEAAT NFDVGLQYYL EWLQDYDDLE ARWQADDDLL
PEEYRQVLTT RLTYSVWRDN LIGSLFAFYS PDDEDYYLRP SVRYRASDAL SYSVGGNLFG
GDSDHTFYGQ FKRDSNLYAR VRYRF