Gene Haur_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3231 
Symbol 
ID5735099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4091173 
End bp4092123 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content53% 
IMG OID641280377 
Productglycosyl transferase family protein 
Protein accessionYP_001545996 
Protein GI159899749 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000094491 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTATC TATCGTTAAT TTGTACAGTT AAAAACGAGG CTGATAATAT CGCCGATTTG 
CTAGATTCGA TGTTGGCACA AAGCCGCCAA CCTGATGAAA TTGTGGTCAA TGATTGTGGC
TCAACCGACT CAACCGCCGC GATTGTCCAA ACCTATATTG AGCGTGGTGC ACCAATTCGC
TTGGTCCATG GTGGTTTTAA CATCTCTTCT GGTCGGAATA ACGCCATTGT GCATGCCCAA
GGCGACTTAA TTGCCTCGAC CGATGCTGGC TTAGCGCTCG ATCGCACATG GCTCGAACGG
ATTATCGCGC CACTCGAAGC AGATCAGGCC GATTTGGTGG CTGGCTTTTA TCAAGCAGCG
CCGCGCAGCG ATCTGGAAAC CGCAATTGGT TCGACCAACT ATCCGCTGGC TGAAGAAGTT
GATCCAAGCC GATTTTTGGC GGCTGGGCAA TCGGTGGCCT TTCGCAAAGT TGTGTGGCAA
ACCGTGGGTG GTTACCCCGA ATGGCTCGAC CATTGCGAAG ATTTGGTGTT TGATCGGGCA
GCAGTAGCGG CGGGCTTTCG CAGCACAGCG GTGCTCGATG CAGTTGTGCA TTTTCAGCCG
CGCTCCAGTT TTCGTGCCCT CTTTCGCCAA TATTTCTTCT ATGCACGGGG CGATGGGGTT
GCCAACCTTT GGCCGTTACG CCATGCGATT CGCTATGCCA CCTACCTCGG CCTACTCCTT
TTGATCCGCA ACCTGCCCCA ACGCCCATGG CTTCTCGGTG TTTTAGGTTT GGGTATTGCT
GGCTACACTC GCAAACCCTA TCGACGGTTG TGGCGAGCAA CCAAAGGCTG GTCATTCACT
CGCCGCAGCA AAACCTTGGG TTTACCGCCA TTAATTCGCA TGGTTGGCGA TCTTGCCAAA
ATGCTCGGCT ACCCGGTTGG CTGGCTGGTA CGTCTGCGCA AACGTCGATA A
 
Protein sequence
MTYLSLICTV KNEADNIADL LDSMLAQSRQ PDEIVVNDCG STDSTAAIVQ TYIERGAPIR 
LVHGGFNISS GRNNAIVHAQ GDLIASTDAG LALDRTWLER IIAPLEADQA DLVAGFYQAA
PRSDLETAIG STNYPLAEEV DPSRFLAAGQ SVAFRKVVWQ TVGGYPEWLD HCEDLVFDRA
AVAAGFRSTA VLDAVVHFQP RSSFRALFRQ YFFYARGDGV ANLWPLRHAI RYATYLGLLL
LIRNLPQRPW LLGVLGLGIA GYTRKPYRRL WRATKGWSFT RRSKTLGLPP LIRMVGDLAK
MLGYPVGWLV RLRKRR