Gene Haur_4465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4465 
Symbol 
ID5736316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5709685 
End bp5711124 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content53% 
IMG OID641281628 
ProductO-antigen polymerase 
Protein accessionYP_001547225 
Protein GI159900978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000989096 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCAA CGTTTGATCA GCGGCGGCGG CGTGAATTTG GCTTGATTGT TGGCGGCACA 
TTAGTTGGCA TGGGCTTAGG CGCTGCTGCC GCGTTTGTGC CGAGCTTTTT GGTCGTCGCT
GGTTTGGTGG CGTTGTTGGT CGGGGCATGG TTTGCCCGTT CGATCCACTC GATGCTGACG
GCCACCGTGT TGGTGGCAAC GCTCTTACCA TTTGGCACCT TGCCTTTCAA AGTTGGGCTA
ACTCCAGCCT TGCTCGAACT AGCATTATTG GCGCTGTATG CCATGTTGGT GGTGCGCAGC
TTGGCCGACC CTGAGCGTAC TTGGCGTTGG GGCAGCCTAG CCCCATGGGT GATTTTGCTG
CTAGCTAGTT CGTTTTTCTC CTTCATCATT GGCTCGAATG GCTCGCCCGA TAGTTTGCTG
CTGCATAATT ATTTCAAATT GCTGCTAGGG ATTTGCTTGT TTTTGGGGGT GCAAAATGCG
CTTGATTCAC TAGAACAGGC GCGTTGGTGG CTGCGCTTGC TGATTTTGGC TGGCTGGGCG
GCGGCATTGT TAGGCATTGG TTTGCGCTTT GCCCCCGACG CTATGGCTTT GCGCTTTTTG
ACCGCACTTG CTCCGCTGGG TTATCCGGCG AGTGGTCGGG TGTTACGCTA TGTTGAAGAT
GACCCCAGCG GCTTTGAGCG AGCAATTGGT ACTTCGGTTG ATCCCAATGG CTTTGGTGGG
ATGATGGCCT TGCTAGGAGC GATTGCGCTT GGTCAAGCCT TGGCCCAGCG CCCAGTCTTA
GGCCGTAAAT GGCTATGGCT GATCACCGCT AGTTTTGCTT TGGCTGTATT TTTGACATCC
TCACGGGCTG CCTTGGGTGG CTTTATGATC GCCGGCTTAT TTTTGGCAAC CGTGCGCTAT
CGCCAATTGT GGTGGCTGAT TGGCGCTGGC GGTCTTGCTG GCGCAATCGC GATTGTGGGC
TTGGGCAAGG GTGGCGATTT TGTCGAGCGG ATCGTCGAAG GCATTCAATT CAAAGATCAA
GCCAACCAAA TGCGCTTGGC TGAGTTTCGC AATGCAATCG CGATTATACG CGAGTATCCG
GTGTTTGGGG TGGGTTTTGG TCGCGCACCC AACATTGATC TCACAACTGG TGTGAGTAGT
GTCTATTTGG CGCTTGGCTC GCGTATGGGT TTGGTTGGCT TAGGCCTCTA TATTTTAACT
GCACTGGCCT TTTTGGTGCT TACCACTCAG GCTGCACGCC GCTGTGAACG CTCGGTAAGC
GATGCAATTA TTGGTTTGCA GGCAGCAATT TTGGCGGCGC TAGCAGTTGG TTTGCTCGAT
CATTATTTCT TCAATATTGA GTTTCCGCAT ATGGGGACGC TATTTTGGGG GGTGGTTGGC
TTGGCGATGG TGTTTATGCG CGAGGTAAAG AATGATCAGC TAACTTCATC ATTGAAATAA
 
Protein sequence
MFATFDQRRR REFGLIVGGT LVGMGLGAAA AFVPSFLVVA GLVALLVGAW FARSIHSMLT 
ATVLVATLLP FGTLPFKVGL TPALLELALL ALYAMLVVRS LADPERTWRW GSLAPWVILL
LASSFFSFII GSNGSPDSLL LHNYFKLLLG ICLFLGVQNA LDSLEQARWW LRLLILAGWA
AALLGIGLRF APDAMALRFL TALAPLGYPA SGRVLRYVED DPSGFERAIG TSVDPNGFGG
MMALLGAIAL GQALAQRPVL GRKWLWLITA SFALAVFLTS SRAALGGFMI AGLFLATVRY
RQLWWLIGAG GLAGAIAIVG LGKGGDFVER IVEGIQFKDQ ANQMRLAEFR NAIAIIREYP
VFGVGFGRAP NIDLTTGVSS VYLALGSRMG LVGLGLYILT ALAFLVLTTQ AARRCERSVS
DAIIGLQAAI LAALAVGLLD HYFFNIEFPH MGTLFWGVVG LAMVFMREVK NDQLTSSLK