Gene Haur_4880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4880 
Symbol 
ID5736957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6216253 
End bp6217434 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content55% 
IMG OID641282046 
Productglycosyl transferase group 1 
Protein accessionYP_001547638 
Protein GI159901391 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTG TTGCGCTGGC ACCGTTTGGC TTGCGCCCCA AAGCCACATT AAGCCGTCGG 
GCCTTGCCGA TGCTCCAAGC TGCCGCCACG CGTGGTGATG ATGTGCATGT GCTGGCTCCA
AGCGATCTGT ACCCCGCTGA TGCTGGCTCA ACCAGCGTGA TCAAGCAGAT TACGGTTGAG
CATGGCCCAG CTTTTGGTCA AGGCAGTGCG GCGATGCTGC GCTCGGTGAG TTGGATGCTC
AAGCGCTGTT TGGCGCTTCA GCCCGATCTG GTTCACCTGT TTAAACCCAA AGGCTATGGT
GGCATGGCCT TGCCCTTGAT TCGTCGCTTA CGCCCTAAAT TACCCATTTT TGTGGATACC
GATGATTGGG AAGGCACTGG CGGCTGGAAT GATCGGCTCG ATTATCCGCG CCATATCAAA
ATGATGATCG ATTGGCAAGA ACGTAATCTA CCTAAATTGG CCGATTGCGT CACGGTGGCC
TCGCAAACCT TGGCCAACCA AGTGATTTTG TTTGGCTTGC CGACCAACAA GCTGCTTTAT
CTACCGAATG GAGTTGATTT GCCGCGTCGC CAATTGCCTG AGCGCAGCCT CGCCCGTGCC
CAACTTGGCC TCACCCAAGA CCCAATTATT TTGCTCTACA GTCGTTTTTG GGAATTTCCG
GTGAGCGATG TTGTGGCGAT GATGGTTGGC GTGTTGGCCC AGATTCCTAC GGCGAAGCTC
TTGGTGATTG GGGCTGGCGA GCATGGCGAG GAGCAGCAAT TAACACTGTT GGCTCAGCGG
GCAGGCATCA ACCAGGCGCT GGATAACCGT GGTTGGAGCG AGCAAAGTAC GATCGATGCG
GCGCTAGCTG CTGCCGATAT AGCGCTCTAT CCAATGGACG ATACCCTGCT AAATCGCGCC
AAGTGTTCGG CCAAACTAAC CGAACAAATG CAAGCTGGCC TACCGATTGT GGCCGCAGCG
GTTGGCCAGG TAGCCGAATA TCTCGACCAA ACCAGCGCTG TGTTGGTTGA ACCAAGCAAT
AGTGGAGCTT TGGCACAGGC CGTGATTCAG CTCTTGCAGC AGCCACAACA ACGCCAAAGT
TTAGGGCGAG TTGCTCAGGC TCGCATCGAA AAACTCTTTA ATTGGCCTGC GCAAAGCCAA
ACCCTACTCC AACGCTACGA TGCCTGCTAC AAGGCACAAT AA
 
Protein sequence
MRIVALAPFG LRPKATLSRR ALPMLQAAAT RGDDVHVLAP SDLYPADAGS TSVIKQITVE 
HGPAFGQGSA AMLRSVSWML KRCLALQPDL VHLFKPKGYG GMALPLIRRL RPKLPIFVDT
DDWEGTGGWN DRLDYPRHIK MMIDWQERNL PKLADCVTVA SQTLANQVIL FGLPTNKLLY
LPNGVDLPRR QLPERSLARA QLGLTQDPII LLYSRFWEFP VSDVVAMMVG VLAQIPTAKL
LVIGAGEHGE EQQLTLLAQR AGINQALDNR GWSEQSTIDA ALAAADIALY PMDDTLLNRA
KCSAKLTEQM QAGLPIVAAA VGQVAEYLDQ TSAVLVEPSN SGALAQAVIQ LLQQPQQRQS
LGRVAQARIE KLFNWPAQSQ TLLQRYDACY KAQ