Gene Hhal_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1507 
Symbol 
ID4709120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1629989 
End bp1631521 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content63% 
IMG OID639855974 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_001003076 
Protein GI121998289 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.577803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTTTAG AGCAACTCTA TCAGCACATT ATCCAGCAGC TGCGCGCCTC CTGGCGGCGC 
CGGTGGTGGC TGATCCCGGT CGCGTGGGCG GTCTGCCTGG TCGGCTGGGC TTACATCAAC
ACCTTACCCG ATGTCTATCA GTCTTCAAGC CGCGTCTACG TAAACTCTCA GACCGTGCTC
GAACCCCTGT TACGCGGTAT GACCGTCCGT CCGGACACAG AGCAGCGCGT CCGCATGATG
ACTGTAACCC TTCTCAGCAA CGACAACCTC AGGGAGATCG CTCGCCAAGC GGATCTCGAT
GTCCTGCTCA ACCAGGATAA TGAACAAGCG TTGATCGGCA CCTTGCGCGG CGGCATTCAA
TTGGACGGTG GCCGGCGCGA TAACATCTAC ACCATCGCCT TTTCCCACCG GGATCCCGAG
GTCGCCTATC GGGTCGTCCG CGAGACCTCC AATCTGTTCA TGGAGCGCGG CCTTGGCGAC
TCGCGGGTTG ATCTCGCCTC CTCGCAGACG TTCATCGAGC GACAGCTCCA ACGTTACGCC
AGTCAGCTGC AGAACAAGGA GGCTGAGCTT GAATCCTTCA AGCGTGAGAA CCATTCGCTG
CTCAGCGCTG GCGGCAACTA TTACACCCGG CTGGAGCGCG CTCGCGACGC CCTTGAGCAG
GCACAGCTCG AGCGGGACGA ACACGCCCAA CGTTTAGAAA CTCTGCAGGC CAGGCTCGAA
AAAGACAGAC AATCCCCCAT TGCCGAGGAC GCGCATTTGA GTAACCCCCG GCTGGACCAA
CGGATCAGCC GGCTGGAATC CCAGCTCGAT GAGATGCGGC GCCACTTCAC GGATGCCCAC
CCGGACGTCG CCCAAACTCG ACGCATCCTC AAGGAGCTTG AGGAGCGGCG CCGTGAAGAA
ACCCGCATGG CTCTCGCAGA TCCCGCTCGG TCCGTCGAGG GCGTTGTCGG CAGCCCGCTC
CGGTTGGCAT TGGTTGACGC AGAAAGCCAT GCTGCTTCGC TGGAAACCCG CGTTCAAGAG
CATAAGCGGC GCGTGGAGAA CATCGCCGCG CTTGTCGATC AGGTTCCTGC CATCGAATCC
CGATTCAATG CCTTAAAGCG CGACCACGAG GTGCTGCAGC AAAGCTATCG CCAACTGCTG
ACCACCCGCG AGCGGGCCGC CATGACCGGG TCCGTCGAGA CCGAGACGGC TGCCGTCGAT
TTCCGGGTCC TGGAGCCGCC GACACGGCCG AGTTCGCCGT CCGCACCCGA TCGCCCCCTG
CTTGCGAGCG GTGTCCTCCT GCTAGGCCTC GGCGCCGGCT CCGGACTGGC CTACCTGCTC
GCTCAGTTGC GCGGCACCGT CACCTCCACC GCACGCCTGG CGGAGATCAC CCGTCGCCCC
GTACTGGGCT CGGTCACCCG GGTCCCGACC CCGAACCGGC GGCGGCGTCA ACGGCTCGAG
CTCATGATCT TTGCAGGGAT CCTCGGAACC CTCTTTGTGG CCTATCTGAT GGTGTTGGCG
TACTACGGCG GCGGGGGGTT GTGGCCGTTC TAG
 
Protein sequence
MGLEQLYQHI IQQLRASWRR RWWLIPVAWA VCLVGWAYIN TLPDVYQSSS RVYVNSQTVL 
EPLLRGMTVR PDTEQRVRMM TVTLLSNDNL REIARQADLD VLLNQDNEQA LIGTLRGGIQ
LDGGRRDNIY TIAFSHRDPE VAYRVVRETS NLFMERGLGD SRVDLASSQT FIERQLQRYA
SQLQNKEAEL ESFKRENHSL LSAGGNYYTR LERARDALEQ AQLERDEHAQ RLETLQARLE
KDRQSPIAED AHLSNPRLDQ RISRLESQLD EMRRHFTDAH PDVAQTRRIL KELEERRREE
TRMALADPAR SVEGVVGSPL RLALVDAESH AASLETRVQE HKRRVENIAA LVDQVPAIES
RFNALKRDHE VLQQSYRQLL TTRERAAMTG SVETETAAVD FRVLEPPTRP SSPSAPDRPL
LASGVLLLGL GAGSGLAYLL AQLRGTVTST ARLAEITRRP VLGSVTRVPT PNRRRRQRLE
LMIFAGILGT LFVAYLMVLA YYGGGGLWPF