Gene Haur_4116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4116 
Symbol 
ID5735977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5262850 
End bp5264148 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content48% 
IMG OID641281270 
Producthexapaptide repeat-containing transferase 
Protein accessionYP_001546876 
Protein GI159900629 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGA TTGTTCTCCG CGATCCTACG CTTATCGCTC CCTTTGGAGA ACCAGCGCGT 
GATTTGCGGA TTCTCAATAA GCCGCTGTGG CTGCTACACC GTGATCTGCT TGCACGCCAC
TGCCAAAGTG TTGCGGAAGT AGACGATTGG TCTGAAATTT CACCAAGTAG TGATGAACTC
TTGGTGCATA AAGATAATCT CTATTTCAAC CGCGATTTCA TTGAGACGTT TATCGCTGAG
GCACGCGCCA CTGGGCAACC TTGCCAAGTG GCTTTTGCTG CTGATGATGC GATGATTACG
GCCCATGCTT TGCGATTACA AGAGGGTATT CGCAAGCATG GCAACCATTT TATCGCCGAC
CTCTACTATT TCCCTCGCGG CGTTGTCCCG AATCCACAAC CGCTGGTGAT CGATACCAAT
GCCATGGAGA TGGGCTACTA CCATATTCCA AGCTATATGG CGCCCAACCA AGGGGATTTG
GTATTCCAGG TGCCAATTCG TGCGTTTTGT TCAATCGAAA GCTGGGTCCA TATTTTCATG
ACAAACTCTC CGCTCGGGGT GTTTGCATGG GGTCGGAAGC TCGAGCAAGA AGTTGCCGCA
AGTTGGCGTT TGAAGTTGAA GATTGGCTTT CGTTCATTCA TCGAACGTAA GCACTTTCTT
TCATCATCTC CGGTGGTCAA GATTGGCAAG AACTGCTCAA TCGATCCTTC GGCGATTATT
CAAGGGCCAA CTGAGATCGG TAACAACGTG AATATTGGCG CTGGAGTGGT GATTACGAAT
AGCTTGATCG GTAATAACGT GACGATTATG CAAGGCTCTC AAGTGATGCT TAGCGTAGTC
AGTGATCGTT GTTATTTACC ATTCCGGGCT GCTCTGTTCA TGACTGTCTT GATGGAAAAT
TCGATGGTGG CGCAAAATAC CTGTTTGCAG TTATGCGTCG TTGGCCGTAA TACCTTTATC
GGGGCTGGCA ATACCTGTAC CGATTTCGAT CTGCTGGGCA AGCCAATCAA GACGCTCCAT
CGCGGGCGCT TGGAAGAAGT TGGTCTGCCA GTTATTGGCT CGGCAATTGG CCATAATTGT
AAAATTGGCT CAGGCTTTGT CATTTACCCA GCCCGTAATA TTGAATCAGG CACGGTCTTG
ATTTATGGCG ATGACCATTC GGTTATTCCT AAAAATGTTT CGAGTGGTAT TTATACGCGC
CCACCAGTTT TCTATCCCGA CCGCGATCCG CGCGTTCAAC GTGTTCCAGT TAACGATCGC
GTCGCAGATG AGTACCCAGC AGAACAATTC CGTGATTAA
 
Protein sequence
MKRIVLRDPT LIAPFGEPAR DLRILNKPLW LLHRDLLARH CQSVAEVDDW SEISPSSDEL 
LVHKDNLYFN RDFIETFIAE ARATGQPCQV AFAADDAMIT AHALRLQEGI RKHGNHFIAD
LYYFPRGVVP NPQPLVIDTN AMEMGYYHIP SYMAPNQGDL VFQVPIRAFC SIESWVHIFM
TNSPLGVFAW GRKLEQEVAA SWRLKLKIGF RSFIERKHFL SSSPVVKIGK NCSIDPSAII
QGPTEIGNNV NIGAGVVITN SLIGNNVTIM QGSQVMLSVV SDRCYLPFRA ALFMTVLMEN
SMVAQNTCLQ LCVVGRNTFI GAGNTCTDFD LLGKPIKTLH RGRLEEVGLP VIGSAIGHNC
KIGSGFVIYP ARNIESGTVL IYGDDHSVIP KNVSSGIYTR PPVFYPDRDP RVQRVPVNDR
VADEYPAEQF RD