Gene Haur_4125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4125 
Symbol 
ID5735986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5272330 
End bp5273958 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content51% 
IMG OID641281279 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001546885 
Protein GI159900638 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACTAC AATTCCTAAG CCTACCAACA GTTTCATTGC GAAGCTGGGT TTTGGTAGGC 
TTATTCAGTT TGGGGCTGGT AGGCTTATTG GGCTTAACCT TGGCAGGCAT TCGCCGTTGG
AGCAAGCCAC GCCAAGCAAC TGACGACCAT TTGAGCACGA TTGGCCGCAA CACTAGTATT
CCCTTTGCTC TACAAATGGC CAGTCGCATG CTTGATTTGG TCTTTGCGAT GATTCTTTAT
CGCTTTTTGG CTGCCGAAAC CGTTGGAGCC TACGACTTTG CGGCAGTGAT TGTCGTCAAT
TATTTTGGCA CAATTGCCGA TTGGGGCTTA ACGGTTTTGG CAACGCACGA AATTGTGCGC
CAGCCAAGCC AAGCGCCCCA AACATTTCGC ACAACGCTCT GGCTCCGTTT GCGTTTTGCA
ATTTTAGCCT TGCCAATTGC CGTGATTTTT GTGCTGATCT ACAACGGCTT GGCGCAGGCT
GAGATTACGG CGGTTGGCCT GACCAGTCAG CAAATTACGG TGATCACAAT TTTGATGCTG
ACCTTGTTTC CGGCGGCACT TTCGGCCAGC GTCACCGCTT GGTTGCAAGG CCACGAGCGC
TTGGTCGCGG CTGCGGTCGT CAATCTTTTG ACCAATATTG GGAGTGCAGC ATTTCGCTTA
ACTGCCTTGA TTTTGGGCTT TGGCATTATT GGGATTGCTA GCGGAGCCTT GGCGGGAGCA
TTGCTCAGCG CCCTCTTATT TTGGCTGGCG ATGCGGCGTT TCTTCCCCGA AGTAGCGTGG
TTTGGCCCAA CCTTACCCGC CAAACCCTTG CTCAAAGAGG GCTACCCGCT CTTGCTCAAT
AGTTTGTTGA TGACGATCTT TTTTCGTTTC GACACCATTT TGTTGAGCGC CTTCCACGGC
TTTGTGGTCT CGGCAACCTA TGGCGTAGCC TATAAACTGA TTAATTTCAC CCAAATTGTG
CCGCCAATTG TGGTTAACGC GATTTTCCCG ACGCTAATTC GCCGTTCCGG CGATGATCGA
GCTGGAATGA GTCGGGCTTA TGCTGGCACA TTGCGTATGT TGCTGAATTT AGCGTTTGGC
ATCGCCGTTG TGGCTACAAT TATCGCTGTG CCACTAACCA CATGGCTCGC CGATCGGCCT
GAATATTTGC CAGGCAGCGT CTATGCCTTG ATGATTACGA TTTGGTATTT ACCAGGCAGC
TATCTGAATG GCCTGACTCA ATATGTGATT ATCGCGCTGG GCAAGAAACA GGCAATTACT
AAGGCTTTTG GTTTAACTGC AATGGTCAAT TTGGGCTTGA ATATTTGGTT GATTCCACGC
TATAGCTATT TTGCCGCCGC CGCAATCACG ATTGTTTCTG AGCTTGTGTT ATTTTTGCCG
CTCTGGCTGG TACTACGCCG CGAACAGATT AACATCAACT TGGCGAGTTT ATTTTGGCGG
CCTGCGCTGG CAGCATTGCT GGCTGGTGGT ATCGGCTGGT TGTTGCTCAG CATCAATGTG
TATTTGGCAG GAGTCGTAAC CGGATTAATC TATGGCGCTG GCTTATGGTT CAGCGGCAGC
ATCGGCCAAA CAGAACGCGA ATTGGCTGCG CGGATGTTTA AAAAGTTACG CCCCCAAGCA
TCAAGCTGA
 
Protein sequence
MALQFLSLPT VSLRSWVLVG LFSLGLVGLL GLTLAGIRRW SKPRQATDDH LSTIGRNTSI 
PFALQMASRM LDLVFAMILY RFLAAETVGA YDFAAVIVVN YFGTIADWGL TVLATHEIVR
QPSQAPQTFR TTLWLRLRFA ILALPIAVIF VLIYNGLAQA EITAVGLTSQ QITVITILML
TLFPAALSAS VTAWLQGHER LVAAAVVNLL TNIGSAAFRL TALILGFGII GIASGALAGA
LLSALLFWLA MRRFFPEVAW FGPTLPAKPL LKEGYPLLLN SLLMTIFFRF DTILLSAFHG
FVVSATYGVA YKLINFTQIV PPIVVNAIFP TLIRRSGDDR AGMSRAYAGT LRMLLNLAFG
IAVVATIIAV PLTTWLADRP EYLPGSVYAL MITIWYLPGS YLNGLTQYVI IALGKKQAIT
KAFGLTAMVN LGLNIWLIPR YSYFAAAAIT IVSELVLFLP LWLVLRREQI NINLASLFWR
PALAALLAGG IGWLLLSINV YLAGVVTGLI YGAGLWFSGS IGQTERELAA RMFKKLRPQA
SS