Gene Nmul_A2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2400 
Symbol 
ID3786181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2735853 
End bp2737211 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content49% 
IMG OID637812489 
Productglycosyl transferase, group 1 
Protein accessionYP_413081 
Protein GI82703515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTAG CATTGTATGT TCACTGTTTC TTTCCGGGAC ATTATCATGG GACGGAAACT 
TATACGTTGG CTCTTGCGGA AAACCTGAAA AAGCTTGGCC ATGAACCCGT GGTAGTGAGC
GCCATATTCG AGGGAGAAAA AAAAGCCAAA AGCCTTATCA CACGTTACGA CTACAACGGC
ATCCCTGTCT ATTGCATAGA CAAGAATCAT ATCCCTCAAA TGAGTTTGAG GGATACGTAT
TATCAACCCG AGCTGCGGCA CATCCACGCA AATCTGCTTC ACGAATTGCA GCCTGATATC
GTTCATGTCA CTCATCTGCT TAACCATACG GCGATCCTGC TGGATGTGAT CAAGGATTTG
GAGATACCCG CTGTCGCAAC TTTTACCGAT TTCTTTGGTT TTTGCATGAA CGTGAAATTG
GAAGGGGCGA ATGGCGACTT GTGCAAAGGC CCTAACTCAG AACGAACGAA CTGCTTCACC
TGTTGCGCCA AGGCCGGCAT CAAACGGGCA TATCCCGCAA TGAGTGAACA ACGCTTCAAT
AAATTGGCTT CCCTGCTCCG GCTAGGCTGC ATTTCATTTA ATGCTGTACA TAGGCTGCCG
GTACTAAGGC GCAGTCAATT ATCCAGCCAG CTAGAGGTGA TAAAGGTGCG CCCTGAGCTC
TTGTCAGAAC GTTACAGTCT CTATCGGGCC GTCATTGCCC CCACGCGATT CCTGCAATCT
GCTTATGAGG CCAACGGGTT TACCTCGGTT CCTATCCACA AAATTCACTT TGGCGTTGAT
CTGGACCGGA AACCAAAGCC GGGGCGGTCG GGGTCAGCGC CTACCCGTTT TGGTTTTATT
GGACAGATTG CGCCGCACAA GGGAACAGCT TTGCTGGTGG AAGCCTTCTG CCGGTTACCG
GCAGGTCAAG GCGAACTACA TATTTATGGA TCAGAGAGCC AGCATCCTGC CTATTTCCAG
GCTCTGAAGC AGCATTGCGC CGGTTTCGCG GTCTACTTTC ATGGCACTTT TCCAACTGGC
CAAATAAGAC CTGTTCTGGA TGAAATGGAT TTTTTGGTCA TTCCTTCCAC GTGGTATGAA
AATAGCCCGC TCGTACTGCT CAACGCGCTT GCCAGCCATA CCCCAGTGAT CGTATCCGAC
GTCGAAGGCC TGACGGAGTT TTTGCAACCG GATGTAAACG GCTACAAGTT TGCTCGGGGC
GATGTGGATG ACCTGGAGCG AGTGATGCTC CAGGTCATCA CCAGCAAAGA AAATATGCAC
AGGCTCATCC ATTCCACCAA TTATCCAAAG ACCAGCATGA GCATGACAGA AGAGGTTCTG
GAAGTTTATT CTTCGATCCT AAAAGAGAAG ATTGCATGA
 
Protein sequence
MKVALYVHCF FPGHYHGTET YTLALAENLK KLGHEPVVVS AIFEGEKKAK SLITRYDYNG 
IPVYCIDKNH IPQMSLRDTY YQPELRHIHA NLLHELQPDI VHVTHLLNHT AILLDVIKDL
EIPAVATFTD FFGFCMNVKL EGANGDLCKG PNSERTNCFT CCAKAGIKRA YPAMSEQRFN
KLASLLRLGC ISFNAVHRLP VLRRSQLSSQ LEVIKVRPEL LSERYSLYRA VIAPTRFLQS
AYEANGFTSV PIHKIHFGVD LDRKPKPGRS GSAPTRFGFI GQIAPHKGTA LLVEAFCRLP
AGQGELHIYG SESQHPAYFQ ALKQHCAGFA VYFHGTFPTG QIRPVLDEMD FLVIPSTWYE
NSPLVLLNAL ASHTPVIVSD VEGLTEFLQP DVNGYKFARG DVDDLERVML QVITSKENMH
RLIHSTNYPK TSMSMTEEVL EVYSSILKEK IA