Gene Haur_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3355 
Symbol 
ID5735225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4232558 
End bp4233829 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content49% 
IMG OID641280502 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001546119 
Protein GI159899872 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGG AATCTACGCC AACGCAAGCC AGTGTGGCCG AAGCCCCTGC CCCTGCCAAA 
TTGCCACGCG CTCGGCTTTT TCAGATCAGC TTTGGGGTTG GCGGGGTCAT GCTGGCGCGG
GTGTTGGGGA TTTTCACCCA AATTATTTTG GCACGACGCT TGGGTGCTGA GCAATATGGC
TTGTTTAACT CGCTGTATGC CTTGCTCAAC CCACTGATTA TTTTAGCCAA CCTAGGCTTG
GATAATTGGC TATTGCGTCA AAGCACGGAT CGTACAAAGC TCAACAAATC GGTAAGTCGG
GTGTTTGCCT TACGGGGGTT TGTGCTGGCA GGCCTGTTAT TGGCCGCTGC ACCATTTGTC
AGTATGCGCA GCCCTGAATT TACCCCGCAC CTCATTGCCT TGGCTTGTGC CGGAATTATT
GGCGATTTAT TGGTAGGCAC CGCTGATACC GCCTTGCGAG CCTCAATTCA AGTGCTCAAA
GCCTCAAGTT TGCAAATTGT GGTTGCGCTA AGTTTGTTTG TGCCGATTTG GTTTACCCCC
AACTTGCCAT TTTCCACCGT GGTGATCTAT CGTTGTATCG CGGTAGCCTT GGGCTTTGGC
CTGAGTGTCT ACTACTTGCA TGGCATTTTG CGCTGGGTTT GGCAACCACG CCAATGGATC
AATACAATTA GCGAGGCGCG TTTCTACTTT ATCTCCGAGG TGATGGCCTA TTTGACCTTG
CGGGTTGATC TGTTTTTGGT GGCTTTGCTG CTACCAACGA TCAATTCAGG GATTTATGCG
CCACCACTAG CAATTATCAA TAATACCTTT TTAGTGCCCA GTATTACCGC CCAAATCTTG
CTACCGATGA TCGTCAAGCA TCCGCCGCGC TCACCACTCT ATCGCCGCGT GATTAGCTTG
GCAATTGGCT TGAGCATTCT CTACGGCTTG GCATGGTTGG CGCTGCTCAC CTGGCATTCC
GATTGGATTA TCAATCTGAT GTATGGGCCA CAATATGCCG ATGCCATCCC GTTGTTGCAA
ATCATGGCCA TCATTCCATT GCTCAAAAGC CTGAATTTCT GCTGGGCGAC CTTTATGGTT
TCCAACGATC AGCAGCCGTT ACGAGTTAAA TTGCAAACCA TCGGGGCCAC AATTAGCATT
CTCGGCAATT TTGTGGTGAT TCCGTTGTAT GGTCTGCATG GCGTAGCGAT TATTAACATG
ATCACCGAAA TTGCCCTGTT TATCTCGTAT GGTTATGGAG CTTGGCTGAC CTATAAGAAA
GTGATGCGAT GA
 
Protein sequence
MIKESTPTQA SVAEAPAPAK LPRARLFQIS FGVGGVMLAR VLGIFTQIIL ARRLGAEQYG 
LFNSLYALLN PLIILANLGL DNWLLRQSTD RTKLNKSVSR VFALRGFVLA GLLLAAAPFV
SMRSPEFTPH LIALACAGII GDLLVGTADT ALRASIQVLK ASSLQIVVAL SLFVPIWFTP
NLPFSTVVIY RCIAVALGFG LSVYYLHGIL RWVWQPRQWI NTISEARFYF ISEVMAYLTL
RVDLFLVALL LPTINSGIYA PPLAIINNTF LVPSITAQIL LPMIVKHPPR SPLYRRVISL
AIGLSILYGL AWLALLTWHS DWIINLMYGP QYADAIPLLQ IMAIIPLLKS LNFCWATFMV
SNDQQPLRVK LQTIGATISI LGNFVVIPLY GLHGVAIINM ITEIALFISY GYGAWLTYKK
VMR