Gene Hhal_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0799 
Symbol 
ID4711504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp879860 
End bp881257 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID639855258 
ProductO-antigen polymerase 
Protein accessionYP_001002377 
Protein GI121997590 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00494175 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTAACA CACACAAAGC GTACTGGCGA TCTATCCTTC CCCCCTTTGA GCCTTGGGCA 
AATGTCCGAC TCGCGATCGC TGATCGCATT GGCTTCCTTG CCTTGCTCCT CATAGCCTTC
ACAGGGCTGT GGATCGGCGA TTTATACCGT CTTGGCCTGG CATTGGTGGT GGTTGCTTTC
TTGATAGCCG CCGTGGACCT GTGGCCAAGC ATGAAGCGCA GCGCTCTGTT TTGGGTCGGC
GTCGCCTTCG TGCTCTTCGC TACGATACGC CACTGGGTGG CTGTGGTGGA GTTGGGCATC
GATATGGTGG ACTCGCCACC CAAGACCTCA CAAATGATCC GCACCTCTCC GCTCCTGGTC
GCTTTCGCGG CGATATGGCT CCGTGGGGAC GAACAACGTC TAGTGTGGTT TCTGTCGGCG
ACGTTACTCG GTGCAATAGC TTGGCTTATT ACGTCGACTC ATTGGGACGC GTTTATTGAA
ATGTTCCGCA ACTGGGACTG GAGCAACGCG CGTCGAGAGC TGTACGAAGG GAGCACCAAC
CGCACCCCAT TTATTTATCT AGTTCTCAGC CTGGCTCTTA TTACAATCGG CACTGGCTTT
GTCGTACGAT GCACCCATGT GTGGTGGCGT GCTGGCCTTT TCATCGCTTC CCTGACAATG
GCCGTTGGAT TCTTGTCGCT TACCTTGACC ATCGAGACGC GTGGTGCCCA AATCGCAGCA
TTTGTGGCTT ATGGGTTGCT TGCTGCGGCA ATGGTCAGTA AGGGGCTCCG CCACCTCGGC
ATCATTAACC GGGGGACCCG CTGGGCATTA ACGGCCACAG GGGGCGCAAC CATCATTGCT
ATCGGAATCG CGCTCTGGAT CACCATCGGC ACTAGTAGTG AGCGACTGCA CAATACGATG
GAGGCAGGAA AAGCGTTAGC ACAAAATCCG GACTATCTCC ACGAGCCTTT CGAAGCGGCA
CAGGAGCATG ACGCCATTCG AGCCGGGAGC GTAGTCCAGC GCCTCAACCT GGTAAGCTTG
GCTGCTGATG CGATCGCGGA ACGACCATTG GTCGGATGGG GAGGAGGCAC TAGTCATAAA
TGGGTACGCG AGTACGGACG TACTGATTTT CACAACTGGT ATCTTGATGT AACGGTTGCT
TTTGGTCTGA TCGGCGCCGC CCTTTATTTC GGCGGCTTCG TCTATATCCT AGGAAGCAGC
ATTAGGGCCC GAATGATTCA TCGACTTGAT CCTCATGTCG CCCTGTTTGC CTTCAGCGCT
ACGGCAGCTT GGTTGACAAC ACAGCTATTC ACCACTTGGA TCAGCGCCGC ACAAGGACGT
TTTACATTAG TATTTATTGC GACATTGTTG GCGTTCGCTC ACACTGCGCG CTGGCTGCCG
GAGCTACGAA AAAAATAA
 
Protein sequence
MFNTHKAYWR SILPPFEPWA NVRLAIADRI GFLALLLIAF TGLWIGDLYR LGLALVVVAF 
LIAAVDLWPS MKRSALFWVG VAFVLFATIR HWVAVVELGI DMVDSPPKTS QMIRTSPLLV
AFAAIWLRGD EQRLVWFLSA TLLGAIAWLI TSTHWDAFIE MFRNWDWSNA RRELYEGSTN
RTPFIYLVLS LALITIGTGF VVRCTHVWWR AGLFIASLTM AVGFLSLTLT IETRGAQIAA
FVAYGLLAAA MVSKGLRHLG IINRGTRWAL TATGGATIIA IGIALWITIG TSSERLHNTM
EAGKALAQNP DYLHEPFEAA QEHDAIRAGS VVQRLNLVSL AADAIAERPL VGWGGGTSHK
WVREYGRTDF HNWYLDVTVA FGLIGAALYF GGFVYILGSS IRARMIHRLD PHVALFAFSA
TAAWLTTQLF TTWISAAQGR FTLVFIATLL AFAHTARWLP ELRKK