Gene Haur_2957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2957 
Symbol 
ID5734829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3731113 
End bp3732420 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content53% 
IMG OID641280101 
Productglycosyl transferase group 1 
Protein accessionYP_001545723 
Protein GI159899476 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTT TACACGTTAT TCAGCGCTAT TACCCGTATA TCGGCGGCTC AGAGCAGGTT 
TGCCAAGTGA TTGCTGAACG GTTGGCAGCC GAAGGGCATA TGGTCGATGT TTGGACAAGC
AATGCATGGG ATCTTGACCA TTTTTGGGCT AGCGGGCGAC GCACCATCGA TCAGCCAACT
GAGCAACACA ATGGCGTAAC GATTCGGCGC TTTCCGGTCG TGCGTGCACC TGGCCCAGCC
TTGGTCTACC CAATTTTGCG ACGGGCAATG CTCGAATTTG GGCGTTTGCC AGCCACAGGC
AGCTTGTTGA TGGCGCTCAG TCAAATTACG CCGCGCATGC CAACCTTCAA TCGTGCCCTA
AGCCAAGAAG TTGGGAAATT TGATCTGATC CATGCAGCCA ACATTACCTT AGATTTTATG
CTGATTCCGG CGCTAAAATA TGCCAAGCAA GCTAAAATTC CGTTTGTACT TTCGCCATAT
GTCCATTTAG GCGTGCCTGG CGATCGCTCG TTAGTGCGCT ATTACACGAT GCCGCATCAT
ATTAAGTTAA TGCAGCAAGC TGATCGAGTG ATCGTACAAA CGCCGCTTGA GGCCGATTAT
CTGGCTGATT GTGGCATTTC GCGCTCAACC CTACGTTGCA TTGGGGTGGG AGTTGAGCCA
CATGAATTGG CAGGCGGCGA TGCTGAGCGT TTTCGCCAAG AAACTGGCAT TCAACAACCG
TTTGTGCTGT ATATCGGCAC CTTGGCCAAA GAAAAAGGCG CGTTCGACCT GATTCGTGCG
ATGGAGCAGC TTTGGGCAAG CGGGCGTAGC GAGCATCTGG TGATGGTTGG CACGCCGATG
GCCCATTTTG AGCAGCTTTG GGAAACGCTT GATCCTGTCA GCAAGCAGCG GATTCATGTG
TTTGCGCGAG CGCCGCAAGC ACGCAAACGC GATGCCTTGG CCGCTGCCAC GCTGTTCGCC
ATGCCCTCAC GCACCGATTC ATTTGGAATT GTCTATCTTG AGGCTTGGTT GTATCGCTTG
CCCGTGATTG GGGCCAGAGC TGGTGGAGTG CCAGCGGTTA TTCGCGAGAA CGAAACAGGC
TTGTTGGTTG ATTATGGCAA TGTTGCTCAA CTTATAGCAG CCTTGACCAA GTTGCTGACC
AACCCTGATT TAGCCCAACA ACTTGGCCAG CAGGGCTATA TGCGAACCTT GGCCGAGCTA
ACGTGGGAGC GCAAATATGC CCAAATTCGG GCGGTGTACG CAGAACTTGT AGAAACTGGG
GATCGGCTGT CGAGGGCTGG GGATCGAGAG TCAGAGATTA GACTATAA
 
Protein sequence
MRILHVIQRY YPYIGGSEQV CQVIAERLAA EGHMVDVWTS NAWDLDHFWA SGRRTIDQPT 
EQHNGVTIRR FPVVRAPGPA LVYPILRRAM LEFGRLPATG SLLMALSQIT PRMPTFNRAL
SQEVGKFDLI HAANITLDFM LIPALKYAKQ AKIPFVLSPY VHLGVPGDRS LVRYYTMPHH
IKLMQQADRV IVQTPLEADY LADCGISRST LRCIGVGVEP HELAGGDAER FRQETGIQQP
FVLYIGTLAK EKGAFDLIRA MEQLWASGRS EHLVMVGTPM AHFEQLWETL DPVSKQRIHV
FARAPQARKR DALAAATLFA MPSRTDSFGI VYLEAWLYRL PVIGARAGGV PAVIRENETG
LLVDYGNVAQ LIAALTKLLT NPDLAQQLGQ QGYMRTLAEL TWERKYAQIR AVYAELVETG
DRLSRAGDRE SEIRL