Gene Hhal_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0742 
Symbol 
ID4711306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp830613 
End bp831851 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content70% 
IMG OID639855206 
Producthypothetical protein 
Protein accessionYP_001002325 
Protein GI121997538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTACC ACCACCTCGA CACCCTGCGC CGTCACCACC CGGCCTGGCG GCTGCTCGCT 
GCCGATAGCG CGCCGCTGGT GCTCGGCTTC CTGCATCGTG CCTTCGTCGA GCCCAATTGC
CGGGCGCGCG CCCAGGGGGA GCTGACCAAC GCCCTGGAGG ACTACCTGCA CGGCCTGCGC
GAGGAGTTGG GCGAGGCGGT CTATCCACGC TCGGCGGCGG AGTATCTCAA TGACTGGGCG
GCGGACGAAC GGGGCTGGCT GCGCAAGTTC TACCCGCCGA ATACCGACGA GCCGCACTTC
GATCTGACCC CGGCGACGGA GACGGCGCTC GGCTGGCTCG ACGAGCTCCA GGAGCGGCAG
TTCGTGGGGG CGGAGTCGCG GCTGATGACG GTCTTCGATC TGCTCCGTCA GGTGGTCCAG
GGGGCGGAGA CCGACCCCGA GGCGCGGATC GAGGCCCTGG AGCGCCGGCG TGCCGAGATC
GACGCCGAGA TTGCCGCCAT CCGCGATGGG CAGCTGGCGC TGCTGGACGA GACGCAGATC
AAGGAGCGCT TCTACCAGGT GGAGCGCACG GCCCGCGGGC TGCTCTCCGA CTTCCGGCAG
GTGGAGCAGA ACTTCCGTGA ACTCGACCGC CAGGTGCGCG AGCGCATCAC CACCTGGGGC
GGCAGCAAGG GGGCGCTGCT CGGTGAGGTC TTCGGGGAGG CCGACGCCAT CGCCGACTCG
GACCAGGGCA AGAGCTTCCG GGCCTTCTGG GGGTTTCTCA TGTCGCCGGC GCGTCAGGAG
GAGCTCACCG ATCTGCTCGA GCGCGTCCTG GAACTCGAAC CGGTCCAGAC CCTGGGGCCG
GACCGCCGGC TGGCGCGCAT CCACCACGAC TGGTTGGATG CCGGCGAGGA GACCCAGCGC
ACCGTCTCGC GGCTGTCGGC GCAGCTGCGC AAGTTCCTCG ATGACCAGGC CTGGCTGGAG
AACCGCCGCA TCATGGACCT GATCCAGGGG GTGGAGCAGC ACGCCCTGGC CGTGCGCGAT
CACCCGCCTG GAGGATCGGA TCGCCCAGCT CGACCGGACC TACGGCGAGC AGCAGGGCCA
GCGCGACGAG CTCAAGCAGG CCATCGCCGA CAATGGCGGG GATCGCCTGG AGCGGCTCAA
GGCGGAGATC GCCCGCAAGA GCGAGGAGAA GGAGCGGCGC AGCGCCAGTG CCCGCACCTA
CAACGAGCTG GCCCGCGCCC TGGGGCTGCC GCTGGCTGA
 
Protein sequence
MDYHHLDTLR RHHPAWRLLA ADSAPLVLGF LHRAFVEPNC RARAQGELTN ALEDYLHGLR 
EELGEAVYPR SAAEYLNDWA ADERGWLRKF YPPNTDEPHF DLTPATETAL GWLDELQERQ
FVGAESRLMT VFDLLRQVVQ GAETDPEARI EALERRRAEI DAEIAAIRDG QLALLDETQI
KERFYQVERT ARGLLSDFRQ VEQNFRELDR QVRERITTWG GSKGALLGEV FGEADAIADS
DQGKSFRAFW GFLMSPARQE ELTDLLERVL ELEPVQTLGP DRRLARIHHD WLDAGEETQR
TVSRLSAQLR KFLDDQAWLE NRRIMDLIQG VEQHALAVRD HPPGGSDRPA RPDLRRAAGP
ARRAQAGHRR QWRGSPGAAQ GGDRPQERGE GAAQRQCPHL QRAGPRPGAA AG