Gene Haur_4878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4878 
Symbol 
ID5736955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6213291 
End bp6214400 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content52% 
IMG OID641282044 
Productglycosyl transferase family protein 
Protein accessionYP_001547636 
Protein GI159901389 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.777733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTGC GATTTGCGCT GGTTTTAGCC ATTGGCACCA GCATCACTTT TATTTTGACT 
CCCTTAATTC GGGCTTGGTG TATTCGCAAG GGTTGGTATG ATCTGCCTGA GGCGCGGCGT
GTGCATCAGA TTCCGACTCC ACGACTGGGC GGGGCGGCCA TATTTGCCGG CTTTATGGCG
GCGTTGGCGG CGGCAGTCGT CGTGCCTTGG GGTGTGCCGC AAATGCAGCG CTTCCCGATT
GAAAGCTTTC GCTTGGGCTT GCTGGCTGCG GGTGCAACCC TGATGTGGGT CGTGATGACC
ATCGACGATC TCAAAAAACT TTCGGCTCGT TTCCGCTTGA TCATTCAAAT CCTAGCGGCA
TTGATTGCGG TTGGTCCCTA TTTATGGGAA TGGACACTGC ATCCAGCGGT TAATGGGATT
GATGTGGGGG CGCGAGGGAT TATTGCAACG GCCTTTAACA CGCCGTTTAT GCAAGTGAAT
TTTCATGAAA TATGGCCGCC CTTGGCAATC GGCTTCACAA TTTTTTGGAT TGTAGGTATG
ACCAATGCGC TCAACTGGAT CGATGGCTTA GATGGTTTGG CGGCGGGCGT GACGTTTATT
GCGGCGATTG TGCTCGCGAT TCATACCTAC TCGCTGGGCC AATATTCCTT GGTACTTGTG
CCCTTGGCTT TGGCTGGAGC CTGCTTGGGC TTCTTACCGC ATAATTTCCA CCCGGCCAAA
ATTTTTATGG GCGATGGTGG CGCGATGGTA ATTGGCTATA CTTTGGCGAT TTGCTCGATC
ATCGGTGGAG CCAAGCTTGC CACAGCCTTG TTGGTGTTGG GCGTACCCTT GCTCGATGGC
GTGTGGATGA TCATCTGGCG GCGAGTGCGC GGAGCAGGGG CTAGCGTCTC AGATCGCGGC
CATTTGCACC ATCGTTTGCT TGATTTAGGC CTCTCGCAGC GCCAAGTTGT GGCGTTTTAC
TACACAGTCA GCAGCTTATT TGGTAGCTTG GGCTTGTTAT TACCCGATAG CTGGTGGAAA
TTGGGAGCTT TGGCGATTTT GACAGGCTTG ATGATTGGGC TGCTTTTCTA TTTGGCCCGC
AAACAACCCC AAACCCAAAA TTCACATTAA
 
Protein sequence
MILRFALVLA IGTSITFILT PLIRAWCIRK GWYDLPEARR VHQIPTPRLG GAAIFAGFMA 
ALAAAVVVPW GVPQMQRFPI ESFRLGLLAA GATLMWVVMT IDDLKKLSAR FRLIIQILAA
LIAVGPYLWE WTLHPAVNGI DVGARGIIAT AFNTPFMQVN FHEIWPPLAI GFTIFWIVGM
TNALNWIDGL DGLAAGVTFI AAIVLAIHTY SLGQYSLVLV PLALAGACLG FLPHNFHPAK
IFMGDGGAMV IGYTLAICSI IGGAKLATAL LVLGVPLLDG VWMIIWRRVR GAGASVSDRG
HLHHRLLDLG LSQRQVVAFY YTVSSLFGSL GLLLPDSWWK LGALAILTGL MIGLLFYLAR
KQPQTQNSH