Gene Haur_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3356 
Symbol 
ID5735226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4233826 
End bp4235133 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content52% 
IMG OID641280503 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_001546120 
Protein GI159899873 
COG category[S] Function unknown 
COG ID[COG2327] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT TATTATTGAA TGCCCATTCG CCGCAAAATG CTGGCGATTT GGCCTTACTT 
GAGCAATCGT TGGCCCATTT ACGAGCCGCG TTTCCCCATG CCGATTTGAG CTATGTGATC
AATCAGCCTG ATCCGCCGCA ATGGCTGCCT GCTGATGTTC CCTATATTTT GTCGATCCAT
GAGCATAAAA CCAATTTGAT CAAAGATGCA CCCAAGCCAC GGCGTAAATG GCTGATGCTG
GGGCTGGCAA TCTGGTTGGT TGGCATCAGC CTGATCTATC GCTGGACAGG CATCAAACTC
AAGCCAGCCA AAACCAGCGA TTGGCGTGAA TTACTTGAGC ATTATTTTGA GGCCGATGTT
ACGGCAGCGA TTGGCGGCGG CTATTTGTAT GCGACCAAAG CCTTCGACCT GAACTACATT
TGGGTGTGGT TGGGCGTGGC CTTGCCCGTG CTCATGGGCA AGCCGCTAGT AATGTTGCCG
CAATCGTTTG GGCCGGTCAC TGGCAAAATC AACCAACTGC TGTTGGCATG GTTGGTCAAT
CGCTCAGTCC AAGCCTATGC CCGCGAAGAA CGTTCGCAGC ACTATCTCTA TAGCATCGGG
GTTAATCAAT CAGTTCAGGT TGTGCCTGAT GTCGCCTTCA ACTGCCCAAC CGTCGAGCCA
GCCCAAGCCG AGCACTATTT GGCCCGTTGG TGGCAACCAG CCAAACGCCC TGCTTTAGTC
GTTGGCATCA CGGCGATGGA TTTTGGCATT CAGCACCCAG GCTCGGGGTT TAGCCATGGC
CAACGTTATG AGCAAGCCTT GCTCGATTTG ATTCAACATA TCGGCCAGTG TGATGATGTG
CATATCTTGC TGTTTGCCCA ATGCGTTGGG GCCAGCGAGG CTGAGGATGA TCGCGTGGTG
GCGCGGCGCT TAATCGCTCA ACTGCCAGCC ACTACGCCAA TCACTTTTAT TGATGATCGC
CTGCAACCAG CCATGCTCAA ACAACTCTAC AGCCAGATTG ATCTACTGAT TGCAACGCGT
TTGCATTCGG CAATTTTCGC CTTGAGTACG GCCACACCCA GCTTTGTGAT TGGCTATTTG
CACAAATCGG CAGGCGTGAT GCACATGCTT GGCTTGGCTG ATTACCAAAT TCCGATTGAA
TCGATCGATA GCAGCAACAT TATTAGTGCC TTTGAACAAA CCCTCGCTGC CCGTGGCTCG
ATTAAAACCA TGATGCGCAG CAGCATTCCC GCCTTGCAAA GCCAACTTGA ACGTTTGCCC
AAGCAGATTC GGGCGGCGGT TGGCGATTGG TTACGAGGAG CAAAGTAG
 
Protein sequence
MKVLLLNAHS PQNAGDLALL EQSLAHLRAA FPHADLSYVI NQPDPPQWLP ADVPYILSIH 
EHKTNLIKDA PKPRRKWLML GLAIWLVGIS LIYRWTGIKL KPAKTSDWRE LLEHYFEADV
TAAIGGGYLY ATKAFDLNYI WVWLGVALPV LMGKPLVMLP QSFGPVTGKI NQLLLAWLVN
RSVQAYAREE RSQHYLYSIG VNQSVQVVPD VAFNCPTVEP AQAEHYLARW WQPAKRPALV
VGITAMDFGI QHPGSGFSHG QRYEQALLDL IQHIGQCDDV HILLFAQCVG ASEAEDDRVV
ARRLIAQLPA TTPITFIDDR LQPAMLKQLY SQIDLLIATR LHSAIFALST ATPSFVIGYL
HKSAGVMHML GLADYQIPIE SIDSSNIISA FEQTLAARGS IKTMMRSSIP ALQSQLERLP
KQIRAAVGDW LRGAK