Gene Hhal_0160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0160 
Symbol 
ID4710589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp186157 
End bp187671 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content65% 
IMG OID639854618 
Producthypothetical protein 
Protein accessionYP_001001756 
Protein GI121996969 
COG category[S] Function unknown 
COG ID[COG3517] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03355] type VI secretion protein, EvpB/VC_A0108 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.586065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGC AGACGAGCGC AGATAGGGCC GCAGAAGCCA CGGCCCCGGC GGCAAGCTAT 
GCGCACCTGT GCCAGCTGGC CGAGGTCGAG CCGGTCTCCG GCGCCCTGGA GATCGCCACC
TTCCAGGACT CGGCGGTCAT GGCGGACATC CCCTCGGAGA GTCGATTGAC CGCTGCCCTG
CAGGTGTTCC TGGATCTCGC CAGTCAGGAC GGCGAGCTGG TCGAGCGCAT CGACAAGGCG
TTGCTCGACG AGTATATCGC CCGCATCGAC GCGGCGGTGA GCGAGCAGCT CGACGCCGTC
CTGCACCATC CGGAGTTCCA GCGGGTGGAG TCGGCCTGGC GCAGCCTGCG TTTCCTCGTC
GAACGCAGCG ATCCCAAGGC GAACATCAAG CTGGAGCTGC TCGATGTCTC CAAGGAAGAG
CTGGCCGGCG AGCTCGAGGA TGTCACTGAC ATCACCCAGT CCGGCCTGTA CCAGCACGTC
TATGTGCAGG AGTACGACAC CCCGGGCGGG GAGCCCTTAG CCGCCATGGT CTCCAACTAT
GAGTTCGACT GCTCGGCGGC GGACATCAAC CTGCTGACCG AGGTATCCCG GATTGCGGCG
GCGGCCCACT GTCCCTTCCT CGGTGCCGTC GGCCGGGACT TCTTCGGCAA GGCCTCCCTG
GATGAGGTGG TCCGGATCCC GGATATCGCC AGCTATCTAG ATAAGGCCGA GTACGCCCGC
TGGCGCGGGT TCCGCGACAC CGAAGACGCT CGGTACGTCG GCCTCACCCT GCCGCGGTTC
CTGCTGCGCC TGCCCTACGG GGCCGACAAC CCCACCCGTG CGTTCGACTA CCGCGAGAAC
GTCACCGGGG TCGATCACGA TCGCTACCTT TGGGGGAATG CAGCCTTCGC CTTTGCCGCC
AACATGGCCC GTTCCTTCAA GGCCTACGGG TGGACGGTCA ATATTCGCGG GCCGGAGTCC
GGCGGCAAGC TCGAGCAGCT GCCGATCCAC GTCTTCGACC TCGGCCGTGG GGCGCAGACC
AAGACCCCCA CCGAGGTGCT CATCTCCGAG AACCGCGAAA TCGAGCTGGC CGAGGCCGGA
TTCATCCCGC TGAGCTTCTA CAAGAACAGC GATTACGCCT GCTTCTTCTC GGCCAACTCG
GCGCAGCGTC CGGCCCGCTA CAACAGTCCC GCGGCGACGG CGAATGCCCG GATCAACGCC
CGGTTGCCGT ACATCTTCCT GGTCTCCCGT CTGGCTCACT ATCTCAAGGT GCTGCAGCGG
GAGAATATCG GCTCGGCCAA GAGCCGGCAG GACCTGGAAA ACGAGCTCAA CGATTGGCTG
CAGGGGCTGG TGACCAAGAT GCAGAATCCG GATCCCGATC TGGTCGCTAC CCGCCCGCTA
CGCGAGGGGG TGGTGGAGGT CGAGGAGGTC CCCGAAAACC CGGGCTTCTA CCGGGTCAAC
ATGTCGGTGA TGCCGCACTT CCAGATCGAG GGTATCGACC TGAAGCTCTC GCTGGTGTCG
CAGTTGCCGA CCTGA
 
Protein sequence
MSEQTSADRA AEATAPAASY AHLCQLAEVE PVSGALEIAT FQDSAVMADI PSESRLTAAL 
QVFLDLASQD GELVERIDKA LLDEYIARID AAVSEQLDAV LHHPEFQRVE SAWRSLRFLV
ERSDPKANIK LELLDVSKEE LAGELEDVTD ITQSGLYQHV YVQEYDTPGG EPLAAMVSNY
EFDCSAADIN LLTEVSRIAA AAHCPFLGAV GRDFFGKASL DEVVRIPDIA SYLDKAEYAR
WRGFRDTEDA RYVGLTLPRF LLRLPYGADN PTRAFDYREN VTGVDHDRYL WGNAAFAFAA
NMARSFKAYG WTVNIRGPES GGKLEQLPIH VFDLGRGAQT KTPTEVLISE NREIELAEAG
FIPLSFYKNS DYACFFSANS AQRPARYNSP AATANARINA RLPYIFLVSR LAHYLKVLQR
ENIGSAKSRQ DLENELNDWL QGLVTKMQNP DPDLVATRPL REGVVEVEEV PENPGFYRVN
MSVMPHFQIE GIDLKLSLVS QLPT