Gene Haur_4273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4273 
Symbol 
ID5736132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5455175 
End bp5456398 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content53% 
IMG OID641281433 
Productglycosyl transferase group 1 
Protein accessionYP_001547033 
Protein GI159900786 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTT TGATTGTTGG GCTAGGCGGG GTAACCGCAA CCTTTCGCAA CTGGCCTGAG 
CGAATTGTGG CGCTCGGTTT GGCTCAACGC GGTCATGCCG TTCGCGCGAT TGGAACCCAC
GATCCCAAAC GACCTGCCTT AGCTGCCCGT CATGAAATCA TCGAGGGCGT AACAGTGCAA
CGCGTGCATT CGGGCTACGC GCCAAATCGT GAGTTGCAAC AAGCCTTGGA GCATGGTCCA
AGGCCAGATT TGATTCATTT TATGCATCCG CGCAATGTGC TGGCCGCCCA AACGAGTGCC
TGGGCCAAAC AGCACAAGAT TCCCACGGTT TATACATGGT TGGGGCCGTA TCACGATGCC
TATTTGGTTG ATGATCGTGA GCGTCCATTT GAAACAACCA TCCACTATCA GCGGCCAATT
TGGACGAAAC AACAATTTTG GCAACGCCTC AAATCGGCCC GTTCGTGGCG CACAATCCGC
GACCATCTAC GCAATTGGCG CTTGCATCGC CCATTGTGGG AAGCCGATCA GCTAATTCCA
TGCTCGCAAT TCGAGGCCGA TGAACTTAAA CGCATGGGAT TGCAGCAAGA ATCAAGTGTG
ATTCCCTTGT GGATTGACGA TTCGGCGATT CAGACTACCC CGGTTGTTTT ACCTGATTTG
AAGGTAAGCC GCCCATGGAT TTTATTTGTT GGGCAATTGA CTCCGCGCAA GGGCTACGAT
TTGGCCTTGC GGGCCATGCC AGCAATTTTG CAGCAATATC CCAATGCCAA TTTATTGATG
GTATCGGGGA TTAACCACGC TGAACGGGCC GAAGTTGACC GAATCGCCCA AGAACTGAAT
ATTCAACCAC AGATTCATTT TTTGGGGCGG GTTGATGATG CAACCTTGGT CAACCTTTTT
CGCCATTGCG ATGTCTATCT CACGCCGACC CGTTATGAGG GCTTTGGCTT GACCTTGCTC
GAAGCCATGG CGGCGGGTGC GCCCTTGGTT GCCAGCGATA TTCCGGTGGT TAACGAAATT
GTGCGCCACG GCGAAAATGG CTTGCTAGCA CCCTACAACA ATCCCGAAGC CTTGGCCGCA
GCCGCCAATT TGATTCTTGG GCAGCCGCGT TTAGCTGCCA AACTCCGCAG CGGTGGGCAG
CAAACATGCG AGGTATGGTA TAATCCGGCC CGATGGACGA CCGCCTTAGA GCAGGTCTAT
ACGCGAGTAA TCAATGAGCA CTAA
 
Protein sequence
MQILIVGLGG VTATFRNWPE RIVALGLAQR GHAVRAIGTH DPKRPALAAR HEIIEGVTVQ 
RVHSGYAPNR ELQQALEHGP RPDLIHFMHP RNVLAAQTSA WAKQHKIPTV YTWLGPYHDA
YLVDDRERPF ETTIHYQRPI WTKQQFWQRL KSARSWRTIR DHLRNWRLHR PLWEADQLIP
CSQFEADELK RMGLQQESSV IPLWIDDSAI QTTPVVLPDL KVSRPWILFV GQLTPRKGYD
LALRAMPAIL QQYPNANLLM VSGINHAERA EVDRIAQELN IQPQIHFLGR VDDATLVNLF
RHCDVYLTPT RYEGFGLTLL EAMAAGAPLV ASDIPVVNEI VRHGENGLLA PYNNPEALAA
AANLILGQPR LAAKLRSGGQ QTCEVWYNPA RWTTALEQVY TRVINEH