Gene Nmul_A1544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1544 
Symbol 
ID3785617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1765715 
End bp1766878 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content48% 
IMG OID637811632 
Productacyltransferase 3 
Protein accessionYP_412239 
Protein GI82702673 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3594] Fucose 4-O-acetylase and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTACC CCGCCCTGCC GGAACTATTT CCCACCGATG CACATTATTC CAGACGGCTA 
TTTGTTATGA CGACCCGCAA TAGCACAATC GACGTGGCAA AAGGAATCGG CATATTTTTT
GTCGTACTCG GACATAATTG GCTGTCAACC CATGAGAAAA ACGAACTCCA TATAGTTATT
TTCTCATTTC ACATGCCTTT GTTTTTCTTC CTGGCAGGAA TATTCTTGAG AGCACCGGAC
GGAATCCTGC GTTTCGCAAT AGGCCGGGCA GGATCCTTAC TGAAACCCTA TTTCGTTGTC
CTGACAGCCC TGGGTGTACT CAAGATGCTG AGGGCCGAAC TAGGTGGAGG CGGCGAAGCT
GGGATGAGCG GCATCAGTTA TTTTATAGGC CTGCTTTATG GGACCGGGGA TACGATCGAG
TGGATTGCCC TATGGTTTTT ACCGCATCTT TTTATTTCGT TGATCGCATC CCTCATCATT
TTGAGGACAA TTGAAGCCTG CACGGACAAC AAGGTATGGA TAGTGTCAGT TGCTCTTCTG
CTCTTAGGGG TGGGCATAAG TTCTATCGAT GCCTATCATC ACCCTACGGC AATAGCCGCC
AGCCTTATGG GACCAGGACG ATTCCTGGGA CTTCCCTGGG GCGCCGATCT TATTCCGATA
ACATCTTCCT TCATTATCTT CGGATATCTG CTCGCCGAGC CCGCGAAATC GATGAAATTC
AGCTTGCCCG GCTTATTTAT ATCTGCTGGG GTGTTTGTTG CTCTGCACTT TTATTTTGAT
GACACCATTG ATCTTAATGA AAGGGTATAC GATAGTGCGA TTGTATCGAC CATGGAGGCG
GCGACAGGAA TATATATAAC GTTCAGTATC GCTTCCTTAC TGCAAAATTT TTCATTCTTC
AGAAAACCGC TGGCATATCT GGGATCGGGG ACACTTTTTA TCCTGATCTT TCATGGCTTC
CTGCAAACCC GGGCGTTTGT CGCGCTGCGC CATATCAGTC CTTATATGTA TCTAAACAGC
ATTGTGAGTC TTGCATGGAG TATCGGGATG TCTTTGCTCC TGTGGGAGAT GGCGAAGCGC
CAGCGATGGT TGTCAAAGTT GCTGTTACCA CAAAAACCGC GAAAGGCAAT TGTTCACGAC
GAATTGGGCG GGAGTGCCGG CTAA
 
Protein sequence
MRYPALPELF PTDAHYSRRL FVMTTRNSTI DVAKGIGIFF VVLGHNWLST HEKNELHIVI 
FSFHMPLFFF LAGIFLRAPD GILRFAIGRA GSLLKPYFVV LTALGVLKML RAELGGGGEA
GMSGISYFIG LLYGTGDTIE WIALWFLPHL FISLIASLII LRTIEACTDN KVWIVSVALL
LLGVGISSID AYHHPTAIAA SLMGPGRFLG LPWGADLIPI TSSFIIFGYL LAEPAKSMKF
SLPGLFISAG VFVALHFYFD DTIDLNERVY DSAIVSTMEA ATGIYITFSI ASLLQNFSFF
RKPLAYLGSG TLFILIFHGF LQTRAFVALR HISPYMYLNS IVSLAWSIGM SLLLWEMAKR
QRWLSKLLLP QKPRKAIVHD ELGGSAG