Gene Haur_3580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3580 
Symbol 
ID5735441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4501316 
End bp4502782 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content50% 
IMG OID641280729 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001546344 
Protein GI159900097 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAC AGCCTGAAAC GATTGTCCAG AGTGCGGCTC GTAATACATT ATGGTCGTAT 
GCCGCCACCT ATGGCGGCAA AATTCTGATC TTTATCACCA CGGTGATTCT GGCTTATTTC
GTGACCAAAG CTGAATACGG CTTGGTCGGC ATGGGCTTGA CGGTTATCGC CTTTCTTGAG
GTGATGCAAG ATCTAGGGAT CGGCTCGGCA GTAATTTATC GCGAAGGCGA GGATGCTGCT
AACACCGCTT TTTGGCTTGG CTGTGGAATC GCCTTGACGC TCTTTAGCAT AACTTGGTTG
GCAGCACCCT TAGTTGCCAC CTATTTCTTG CAAAATAATC CTGATGTGAT TCCGATTATT
CGGGCCTTGG GCTTGAGCTT TCCCTTGTTT GCCATGCGCA ATATTCACGA TGCCTTGTTG
CGCAAACGCA TGCAATTCAA AAAACGCTTT ATTCCCGATC TGATTCAGGT CAGCAGCAAA
GGCGCAATTT CGATTGGCTG TGCCTTGGCG GGCATGGGCG CTTGGAGCTT GGTTTGGGGC
CAATTGGGTG GCTCGGCCTG TGGCGTGATT GCCTATTGGA TGGTCAATCC ATGGCGACCT
AACTTCAAGA TTGATCGAGC CGTGATCAAG CCGCTGTTGA GCTTTGGTAG CAATATCGTG
GCAGTCAATG GTTTGGCTGT CATGTTGGCG CAAACCGATT ATCTGCTAGT TGGACGCTAT
CTTGGCGATG CAAACTTAGG TATTTATACC AATGGCTTTC GCTTCCCTGA CCTGATGGTG
ATGCAAGTTT GTATTGCCTT GGCCAATGTG CTGTATCCCT TGTACAGTCG TTTGCGCGAT
GATCCTGCTG GGTTGCAACA GGGCTTTTTT ATTGCCTCGC GGTATGTCGC CTTAGTGACT
GTGCCGCTTT CATTGGGAAT TGCAATTACG GCTGAGCCGT TTGTTTTGAC CTTGCTAAGC
GTAAAATGGA TTGAAACTGT GCCGATTATG CGCACAATTG CCATTTACAC CCTGTTTCTT
TCGCTGGGCT ACAACGTTGG TGATGTCTAC AAGGCGCAAA ATCGCACCAG CATTTTGACC
AAGCTTTCAA TCGCTCGTTT AATTGTGTTA GTTCCGGCGT TATGGTTTGC CGCTGCCCAG
TTCAAATCGT TGCAGGCAGT CGGCATTACC CATGCAATTG TGGCATTTTT GGGCAGCACA
CTTAATTTGG TGGTTGCCGC CAAAGTGCTG AACATGCCGA TTCGCCGAAT TTTGAGCGCC
TTCCAACCAG CCATAATTGC TGGCACGGTG ATGAGTACTT GTTTGATTTT GACCCTGCTG
GCCTTGCAAA ACACTCCTGC ATGGCTGCAA CTCTTGGCGG CAGTCGTGGT GGGCATCGGC
TCCTATGCTG GAACTTTGTG GTTTGGTCAG CGCGAAGTTC TGACCCAAGC AGGCCAAACC
TTACGTGGTG CGGTGGCACG GAGATAA
 
Protein sequence
MAKQPETIVQ SAARNTLWSY AATYGGKILI FITTVILAYF VTKAEYGLVG MGLTVIAFLE 
VMQDLGIGSA VIYREGEDAA NTAFWLGCGI ALTLFSITWL AAPLVATYFL QNNPDVIPII
RALGLSFPLF AMRNIHDALL RKRMQFKKRF IPDLIQVSSK GAISIGCALA GMGAWSLVWG
QLGGSACGVI AYWMVNPWRP NFKIDRAVIK PLLSFGSNIV AVNGLAVMLA QTDYLLVGRY
LGDANLGIYT NGFRFPDLMV MQVCIALANV LYPLYSRLRD DPAGLQQGFF IASRYVALVT
VPLSLGIAIT AEPFVLTLLS VKWIETVPIM RTIAIYTLFL SLGYNVGDVY KAQNRTSILT
KLSIARLIVL VPALWFAAAQ FKSLQAVGIT HAIVAFLGST LNLVVAAKVL NMPIRRILSA
FQPAIIAGTV MSTCLILTLL ALQNTPAWLQ LLAAVVVGIG SYAGTLWFGQ REVLTQAGQT
LRGAVARR