Gene Haur_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1212 
Symbol 
ID5733105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1396331 
End bp1397686 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID641278352 
Productglycosyl transferase group 1 
Protein accessionYP_001543988 
Protein GI159897741 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.360145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTG TGGCTTTTTG CACTCCAGTC AATCCAGTTG AATCGGGTAT TTCGGATTAT 
AGCGAGGAAT TATTGCCCTA TTTGGGGCAG TATGTTGATC TGACGTTGGT GGTTGATGCT
GAGGTTCAGC CCACTAATCA ACAATTGCTC GCCAAACTGC CGATTATCCG CATTGGTGAT
TTAGCCAAGC AGCATGCACG GCAACCTTTC GATGCAATTA TCTATCATAT GGGCAATAGC
CCCGCCCACA GTCGTTTTTG GCAAAGTTTG CAAAGCTTGC CGGGAATTGT GGTGCTGCAC
GATTATGTGC TACATCATTT AATGCTGTGG CATGCCGCCA ATCGCTTGAA AAATGTGGCG
AGCTACCGCC AATTGATGCA GCACTATTAT GCTGAACAAG GCTCAAGCAT TGCCCAACGT
ATGGAGCGTG GCCAGCTTGG CGATGCAGTG TTCGATTTTC CACTTTCTGA GCCAGTGATT
GCCCAAGCCA GCAGCCTAAT TGCTCATAGC CAATATGTGC TTGAGCGGGT GCAGCCACAG
CGCCCCAACT TGGCGACTTC GCTAGTGCCA ATGGGTGTGC CCTTACTACC AGCCCCTGAT
CGTTTGGCGG CTCGGCAAGC GCTCCAATTG CCCGCTGAAA TTCCGATTTG GGCTAGTTTT
GGTCATATCA ATCCCTACAA ACGGATTGAA CAGGCGCTGC AAGCTTTTGC CCAATTTCGC
CGTACTTATC CCGATGCACG GTATATATTG GTTGGCAGCG TCTCGCCAAG CTACGATCTC
AAGGCCTTGC TCCAACGCTT GCAGCTTGGC GAGAGCGTCC AAGTCACGGG CTATGTTGAT
CATGCTGATT TTAATCGCTA TGTTGCTGCC GCTGATCTGT GTTTCAATGG GCGCTACCCT
TCGGCTGGCG AGACTTCGGC CAGTTTGTTG CGCTTGTTGG GTGCTGGCCG CGCAGTTTTA
GTCAGCGATA TTGCTACCTT CAGCGAATTG CCCGCCGATG TGGTGGCTCA TGTGCCCGTT
GATCGCGATG AAGTTGCTTT AATTGCGGCC TATGCTCAGC GTTTATGGGC CGATGTGGCG
CTACGCGAAG CCATGGAAAC CAATGCCCGC CGCTATGTGA CTGAAAAACA TAGTTTGCCC
TTGGCCGCGC GAGGCTATGC CGATCATCTG AGCCGCGTGC AGGGCTGGCC GCGTTTGGAG
CCACAACGTG AGCCATTGTG GGATATTAAT GCTGTCACCA TCCAGCATTC AATCGCCCAA
ACGATTGGCC GTAAAGCCGC TCAACTTGGC TTAGTTGATG ACGATGCGCC GTTGCTTGAT
CGTTTGGCTG CACGGCTACG AAATTTATTG ACATAA
 
Protein sequence
MQRVAFCTPV NPVESGISDY SEELLPYLGQ YVDLTLVVDA EVQPTNQQLL AKLPIIRIGD 
LAKQHARQPF DAIIYHMGNS PAHSRFWQSL QSLPGIVVLH DYVLHHLMLW HAANRLKNVA
SYRQLMQHYY AEQGSSIAQR MERGQLGDAV FDFPLSEPVI AQASSLIAHS QYVLERVQPQ
RPNLATSLVP MGVPLLPAPD RLAARQALQL PAEIPIWASF GHINPYKRIE QALQAFAQFR
RTYPDARYIL VGSVSPSYDL KALLQRLQLG ESVQVTGYVD HADFNRYVAA ADLCFNGRYP
SAGETSASLL RLLGAGRAVL VSDIATFSEL PADVVAHVPV DRDEVALIAA YAQRLWADVA
LREAMETNAR RYVTEKHSLP LAARGYADHL SRVQGWPRLE PQREPLWDIN AVTIQHSIAQ
TIGRKAAQLG LVDDDAPLLD RLAARLRNLL T