Gene Haur_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2689 
Symbol 
ID5734570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3446848 
End bp3447843 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content55% 
IMG OID641279832 
Productglycosyl transferase family protein 
Protein accessionYP_001545455 
Protein GI159899208 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0468436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTT CTATCATTAT TTTAAATTGG AATGGCCGAG CACTGCTCGC TGATTGCCTC 
AACGCCTTAT TGCCCCAATG CGATGCTTCA ATCGAAGTGT TGGTGGTGGA TAATGGCTCA
CATGATGGCT CGGCGGCGTG GCTGCATCAA CATTATCCCC AAGTGCGCTT GTTGGCGCTC
ACCAACAATC GTGGATTTAG CGGTGGGGTC AATGTTGGCT TGCATGTGGC GCGTGGCGAT
GTGCTGTTGT TGTTGAATAA TGATGCAATC GTTGAGCCAA ATTTTATCTC GGCGATTCTC
GCGCCGTTTC AGCACCAACC AACGCTTGCT GCTAGCGCTG GTCTGATGAC GTTCGCGCAT
CGACCTGAAA TCATCGCCTC AGCCGGAATT CAGCTCTATC GCGATGGCGT GGCAACTGAT
GCTGGGTTGT TGCAGCCAGT TGCTCAATTA GCCAGCCAAC CAAGCCCAAT TTGGGGTGGC
AGCGGTGGAG CGGTAGCCTA TCGACGTGCC GCTTTAGCCG ATGTCGGCAT GTTCGATGAA
GGCTATTTTG CCTATTTGGA AGATGTCGAT TTGGCTTGGC GTTTGCAGTT GCGCGAATGG
CAAACCGTGC TAGCCCCGCA GGCCGTCGCT CGGCATATCT ACTCGGCCAC TGGCGGCGAA
GGTTCGCCCT TTAAGGATTG GCTGATTGCG CGTAATCGCT GGCGGGTAAT TTTGCGTTGC
TGGCCAACGC CGTTGTTGGC CCGCGCTCTC CCATTAATGC TAGCTTACGA TGGTTTGGCT
TGTGCACAGG CTATCGTGCG CCGCCGCTGG ACAACGGTCA GCGGGCGTTT GCATGCCTTG
CGCCAACTGC CCCAACTACG CCAACAGCGC CAAGCAATTC AAGCTCGCCG CACGGCTAGC
ATCGCTGAGC TTGATCATTG GATTAAACCA GCCCGTTCAC CACTGGCAAT TTGGCGTGAA
AATCAAGCAC TCGGCCAATT GATCGCTCAA CGCTAG
 
Protein sequence
MNCSIIILNW NGRALLADCL NALLPQCDAS IEVLVVDNGS HDGSAAWLHQ HYPQVRLLAL 
TNNRGFSGGV NVGLHVARGD VLLLLNNDAI VEPNFISAIL APFQHQPTLA ASAGLMTFAH
RPEIIASAGI QLYRDGVATD AGLLQPVAQL ASQPSPIWGG SGGAVAYRRA ALADVGMFDE
GYFAYLEDVD LAWRLQLREW QTVLAPQAVA RHIYSATGGE GSPFKDWLIA RNRWRVILRC
WPTPLLARAL PLMLAYDGLA CAQAIVRRRW TTVSGRLHAL RQLPQLRQQR QAIQARRTAS
IAELDHWIKP ARSPLAIWRE NQALGQLIAQ R