Gene Haur_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3898 
Symbol 
ID5735759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4889680 
End bp4890981 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content53% 
IMG OID641281049 
Productglycosyl transferase group 1 
Protein accessionYP_001546660 
Protein GI159900413 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAA CCCCAGTCGC CTCAATCGAT CACCAATTAC GTGTCGCCAT GCTTTGTCGG 
GCGGTTTTTC CCTTGCATGG CTTTGGCGGC ATCGAGCGCC ATGTCTTTCA TTTGGTGACC
CATCTCAGCG ATTTGGGAGT TAAGCTCGAT CTTTGGACGC AGACAATTCC CCACGATGTG
CCAACTGCTG GCGAGGCCTA TGCTCGTTTG TGTCAAAACC CGTTGATTGA ACTGCATGAA
ACCCGTTATG ATCGCACCAG CCCATGGTTG CGACCGAATA GCATTATCGG GCGACAGTTT
AATTATCCGA TTTTTACTTG GCAACAAGCC AGCGCTGTGG CCAAAACTGC TCAGCAAGGC
CAGATTGATA TTGTGCATAC TCAAGGTTTA TGTGCATTGG GCTGGGGCTT GGTACGTCAA
CAGCAACCCA GTTTGCGGCG GATTCCTCAA TTGGCCAACC CCCATGGCAT GGAAGAATAT
AAGAATGTTG ATTGGCGCAA GCAACTGGCC TATGCCCCGT TTCGTGCCCA ATATTCGTGG
AGTCATCGCC AAGCTGATTG TGCAATTGCC ACCGATGCCT GTACCGCCGA CGATCTACCA
AATTTGTTGG GCGTTGATCC GACGCGGGTT GCGGTGTTGC CCTCGGCGAT TGATGTGGCT
GAGGCCTTGG GCCAGGTTGA TGAGCAATTG GGCAACGAGT TGGTGCAGCG CTTGCAGCTC
GCCGACCACG ATCTGGTTTT TTTGACCGTC AGCCGTTTGG AGCGCAACAA GGGCTATCAT
CTGCTCTTGG CGGCCTTGGC CGAATTGCGC GATCTGCTGC CTGCAAGCTG GCGTTTGTTG
ATGGTTGGCA CTGGCAAAGA GCAAGCAGCG CTTGAACAGC AAGCCCAAAG TCTAGGCTTG
GCGCAACATG TCAGCCTGCT TGGTCGTCTG AGTGATCGTG AATTGCATTC ACTGTATGAA
CATGTTGATT TGTTCATTCA TCCAACCTTG TATGAAGGTT CGTCGTTGGT CACACTCGAA
GCCATGATTC ATCGCTTGCC AGTTGTGGCA ACTGCGGCTG GTGGCATTCC CGATAAAGTT
ATCAGCGGCC ATAATGGCTT GCTTGTGCCA GCCAACAATC AGCGGGCCTT GGTCAATGCG
CTGCGGTTAG CCCTCGATTT GCGCGAATAT TGGCCGCAAT GGGGTGCTGC TGGCGCAGCG
ATTGTACGGC GCAGCTTCGA TTGGCCCGTT GTGGCGCGAC AAACCCTCGC CACCTACCGC
GAACTATTGC AATCTCGCTC TTTGCGTGGA GGATTCCAAT GA
 
Protein sequence
MAITPVASID HQLRVAMLCR AVFPLHGFGG IERHVFHLVT HLSDLGVKLD LWTQTIPHDV 
PTAGEAYARL CQNPLIELHE TRYDRTSPWL RPNSIIGRQF NYPIFTWQQA SAVAKTAQQG
QIDIVHTQGL CALGWGLVRQ QQPSLRRIPQ LANPHGMEEY KNVDWRKQLA YAPFRAQYSW
SHRQADCAIA TDACTADDLP NLLGVDPTRV AVLPSAIDVA EALGQVDEQL GNELVQRLQL
ADHDLVFLTV SRLERNKGYH LLLAALAELR DLLPASWRLL MVGTGKEQAA LEQQAQSLGL
AQHVSLLGRL SDRELHSLYE HVDLFIHPTL YEGSSLVTLE AMIHRLPVVA TAAGGIPDKV
ISGHNGLLVP ANNQRALVNA LRLALDLREY WPQWGAAGAA IVRRSFDWPV VARQTLATYR
ELLQSRSLRG GFQ