Gene Nmul_A0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0432 
Symbol 
ID3785900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp479234 
End bp480280 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content57% 
IMG OID637810508 
Productglycosyl transferase family protein 
Protein accessionYP_411132 
Protein GI82701566 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCGA TGCCCCCCAT CGATGTGGTG ATTCCCGTCT ATAATGCACC AGCGTTGACA 
CGGCGCTGCA TCGATTCGGT CGTTGCCTGC CTGAGCCCAT CCATACGCTT CATTTTCATT
CAGGACGACG CGTCGGGAAT GGAAACCCAT GCCATGCTGG ATCAATTGCC GCGCGGGCGC
ATACGCGTGC ACCACGCGCG GGAGAACCAG GGTTTCGGTG CCTCGGTAAA CGAAGCAATC
AGTCGATCCG ATGCATGCTA CGTGCTGGTC CTGAACTCCG ACACAGTAGT GGGCGAAGAT
TTTTTACCGC TCTTATACGC GACACTCGTC GCCGATTCCC GGCTGGCAGT CATCATCCCC
GCAGGGAATG ATTTTGCCGG ATATGATTTG AACCGGTATG TGCGGCAGCC GGGCGGCTAT
GTTCAGACAC ACCGCCTTCG GGGTCACGCG TTTCTCATCC GCCGGGAGGT ATTCCGGGAT
GCGAACGGTT TCGATCTGGC CTTCGGCCGC GGCTACTATG AAGATGTCGA TCTCGGGCGC
CGCCTCAACA AACGTGGTTG GCGGGTGGGC GTGCATCCGG ATGCTCACAT ACAACATGAG
GGGGGCGGCT CGTTCGGGCG GGGCCGCTCC TTCAAGAAAC TGGTCAGGCG CAATCGTAAG
CTTTTTTTTT CGCGCCATCC CAGCGCAAAG CGTAATATTC TCCTTCTTTC CGGAGACTGC
CCCCTAGCGT ATTTTCCATC GAGCCTGCTA GAGGCGCTCG ACGAGGTATT TTGCGGAGGA
GGATACGTTT ACTGGCTTTC GCCCGAAGCG GCAGGCATGC TTCTCTGTTT GCAGATGAGG
AGCCTCTTGT CAGGCGTGGA CGCTGCCGTG CGGTTATTCT TGTGCAGTTG GCGTGAGGAC
AAGCGCATTT CGGAAATCTG GATATTGCCC GATGTTCCAC GCGTGCGGTA TGAGGCACTG
ACCTTATGGG CGCGTATTTG CGGCTTGCGG ATACTGACCT GGGAAAGGGT GCCGACCGAA
GAAAATTGTA CTCCTAGGCT ACTATGA
 
Protein sequence
MPAMPPIDVV IPVYNAPALT RRCIDSVVAC LSPSIRFIFI QDDASGMETH AMLDQLPRGR 
IRVHHARENQ GFGASVNEAI SRSDACYVLV LNSDTVVGED FLPLLYATLV ADSRLAVIIP
AGNDFAGYDL NRYVRQPGGY VQTHRLRGHA FLIRREVFRD ANGFDLAFGR GYYEDVDLGR
RLNKRGWRVG VHPDAHIQHE GGGSFGRGRS FKKLVRRNRK LFFSRHPSAK RNILLLSGDC
PLAYFPSSLL EALDEVFCGG GYVYWLSPEA AGMLLCLQMR SLLSGVDAAV RLFLCSWRED
KRISEIWILP DVPRVRYEAL TLWARICGLR ILTWERVPTE ENCTPRLL