Gene Haur_4314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4314 
Symbol 
ID5736173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5507373 
End bp5508578 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content51% 
IMG OID641281474 
Productglycosyl transferase family protein 
Protein accessionYP_001547074 
Protein GI159900827 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGATTG ATGCTGCAAT TATTGTTGTA ACCTACAACC ACGCCCGCTA TATTGGCGAT 
TGCCTCAGCT CGTTGCTGGC GCTTGATCCT GCGCCCAGCG AATTGGTGGT AGTTGACAAT
GCCTCGCGTG ATGCAACGGC CAAGATCGTC AAAAATCAAT TTCCTCAAGT GCGCTTTGTG
CAAACCGGAG CCAATTTGGG CTTTGCAGGT GGTTGCAATC AAGGCGCTCG ATTAACCTCG
GCTGAATATA TTGTGCTGGC TAACCCCGAT TTGATTGTAC AGCCCGATTG GCTTGAGCAG
CTTATTGCGC CGTTGGAGCG TTGGCCGAGC GTTGGTGCGG TTGGCGGCAA ATTGTTATAT
CGCGATGGCA CGACCATTCA ACATGCAGGC GGAGTTTTGC GCTTGCCATG GGGTTTAGGC
CATCATCGTG GGGTTGGCGA GCATGATCAT GGCCAATACA ACGCACTTGA AACTGTCGAT
TATGTAACTG GGGCGGCCTT TGCTTGTCGG CGGAGCACTT GGGATGTGCT CGATGGCCTT
GATGAGCAAT TTTACCCGGC CTATTACGAA GAAGTTGATT TTTGTACTCG CCTACGGCGT
GCTAGCCTTG ATGTGCTATA CACGCCCCGC GCAGTCGCAA CCCATATCGA AGGCTCCAGT
GTGGGCCATC GCAGCGCCGT TTATTTGCAA CTGTACCATT TTAATCGGCT ACGCTACCTT
TTTAAATACT TCAACAATAC CTGGCTGATG CGTACATGGC TGCCTGCCGA GATGGGCCAT
ATTCGGGCTT GTGCCAGCGA TGACGAGATT CAAGCGCTGA AAATCGCCTA CTTGGCCTAT
CAATCGGCCT TTTTAAACCA TGAATCCCAG CCCGTCATCA GCGAACTGGA TATTTTTCCC
GATGAGACCG CTGATGGTGG CGAGACCGAA TTACAATGGA TTGAACGCCA ACTTCGTGCT
AAAGTGAAGG TCGAACCAGC GCCGATTCCA GCACGGCAAC GTTGGTTAGG GGCGATTCGC
AATGGCCTGC TACGCTTGGC CACCCGTGAT TTTATTGTGC CGATAGTACA AGCACAAAAT
GATACTAACG CCGCCTTGTT AGAATCGATT CAAGCATTGA GCCGCCAACG CCGTGCAGCC
GATGCAACCA TCCTACTACA AGGCATGTTA TTGGCCAAAA GTTTGGATCA ACAACCAAAG
GCCTAA
 
Protein sequence
MTIDAAIIVV TYNHARYIGD CLSSLLALDP APSELVVVDN ASRDATAKIV KNQFPQVRFV 
QTGANLGFAG GCNQGARLTS AEYIVLANPD LIVQPDWLEQ LIAPLERWPS VGAVGGKLLY
RDGTTIQHAG GVLRLPWGLG HHRGVGEHDH GQYNALETVD YVTGAAFACR RSTWDVLDGL
DEQFYPAYYE EVDFCTRLRR ASLDVLYTPR AVATHIEGSS VGHRSAVYLQ LYHFNRLRYL
FKYFNNTWLM RTWLPAEMGH IRACASDDEI QALKIAYLAY QSAFLNHESQ PVISELDIFP
DETADGGETE LQWIERQLRA KVKVEPAPIP ARQRWLGAIR NGLLRLATRD FIVPIVQAQN
DTNAALLESI QALSRQRRAA DATILLQGML LAKSLDQQPK A