Gene Haur_4385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4385 
Symbol 
ID5736235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5601904 
End bp5603040 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content47% 
IMG OID641281547 
Productglycosyl transferase group 1 
Protein accessionYP_001547145 
Protein GI159900898 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.610092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGATTG GGATTAGTGG CACATTTTGG GCTGAACCAA TGGTGGGCAG CGGTCAGTAT 
TTACATCATT TAATTGAACA TTTGCCGCAT GTTGGTCCTC AGCACGAATA TGTGTTGTTT
TTGCCAGCCT ATACCCAAGC CGAAATTCCT CAAATTCCGC ATATTGCGGT TGATCGTGTG
CCAACGCCGT TTGATAAACT ACACCCTAAA TTGGCCAAGC TTTGGTATGA GCAAATTGAG
TTGCCCCGCG CCGCCTTGCG CTTAGCCGTT GATCTGTTGC ATGTGCCCTA TTATGCGCCG
CCACGTCGCC AATTAGTGCC GACTGTGGTA ACAGTCCACG ACATTATTCC ATTAATTTTG
CCTGAATACC GTGGCTCGTT GGCGATGCGA GCCTATACTG CCCTGGCAAC GAGTGCTGTG
CGCCGTTGTC GTCAATTGGT GGCAGTCTCC GACCATACCC GCGATGATAT TATTGATGTA
TTGAATATTA ATGCATTACA CGTGCATACA ATTTACGAAG GCGTTGCACC TGATTATCAA
CCGCAGACAG ATGAACAGAT TAGCCAAACC TTGCAACGTT TTAATCTTAA TCAGCCTTAT
TTTTATTATA TCGGCGGCTT TGATGTGCGC AAAAATCTCA CGACATTGCT GCGGGCATTT
GGGCGGGTGC GTCGCCGAAT TGAGCAACCA ATTAAATTAG TGATTGCTGG CAGCCGTCCT
AAGGCTAATT CGCCATTTTT TCCTGCTTTA GAAACCACAA TTCTTGATGA AGATTTGGCT
GCTGATATTA TTTTTACTGG GCGCGTCACG AATGCTGAAA ACGCCGCGCT ATTTGCTGGA
GCCAGTGCAT TTGTTTGGCC CTCGACCTAT GAAGGTTTTG GCTTGCCCCC ATTAGAAGCG
ATGAGTTGTG GTACGCCCGT GATTTCTTCG AATACCAGCA GCATGCCCGA AATTGTCGGC
GAGGCTGGTA TTTTGCTGCC GCCACACGAT ACCGAGGCTT GGGCGATGGC AATGTTGCGC
ATGTTGAATG ATGCTGAATT AAATAACGAA TATCGCCAAC GTGGTTTACA ACGAGCCAGC
CAATTTAATT GGCAACACTT TACCGCCCAG ATGCTTAAGG TTTATGAGAA AGCCTAG
 
Protein sequence
MRIGISGTFW AEPMVGSGQY LHHLIEHLPH VGPQHEYVLF LPAYTQAEIP QIPHIAVDRV 
PTPFDKLHPK LAKLWYEQIE LPRAALRLAV DLLHVPYYAP PRRQLVPTVV TVHDIIPLIL
PEYRGSLAMR AYTALATSAV RRCRQLVAVS DHTRDDIIDV LNINALHVHT IYEGVAPDYQ
PQTDEQISQT LQRFNLNQPY FYYIGGFDVR KNLTTLLRAF GRVRRRIEQP IKLVIAGSRP
KANSPFFPAL ETTILDEDLA ADIIFTGRVT NAENAALFAG ASAFVWPSTY EGFGLPPLEA
MSCGTPVISS NTSSMPEIVG EAGILLPPHD TEAWAMAMLR MLNDAELNNE YRQRGLQRAS
QFNWQHFTAQ MLKVYEKA