Gene Nmul_A0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0253 
Symbol 
ID3785739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp273034 
End bp274110 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content49% 
IMG OID637810328 
Productglycosyl transferase, group 1 
Protein accessionYP_410953 
Protein GI82701387 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGGAA CGGACTCAAC TGGCAGGGGA GGTATAGCAT CTGTCGTTAC CTTACTGCAG 
GAAGAGGGCT TTCTCGATCA GCAAAATGTC AAATACATTA CTTCACACCG GGAGGGAACA
CACTTTAAAA AGTTAGCAAT CATGTTTTCC GCTACTGGCA AAGTGCTGTG GTATTGCATG
TTTGCCAAAC CAGCCATCGT TCACGTCCAC TCGGCCTCAG GTGCAAGTTT TATCAGAAAA
TCGATTTTTC TTGCTGTCGC CAGATTGTTC CGCTGCCAGA CAGTTTTTCA TTTGCACGGA
GGTCGGTTTC CTCATTTTGC TTCAGAGGAA TCGGGAGTAC TGCTGAAATG GTGGATTCGG
CGGACTCTTG AGAGGAGTTC CACAGTTATT GCATTATCAG AAAGCTGGGC AGCTTTTCTC
TCAACCTGCG CCCCTGCGGC CGCTATCCAG ATTGTTCCCA ATTCTGTCAG GCTTGCTAAG
ATATCTTCAA AACAGAGGGG GGAGGCCGGG CGGATACTGT TCTTGGGGCA CGTGGGAAAG
GGGAAAGGAA TATTCGAACT GTTAAAGGCT TTATCCCTAC TGAAGGATTC ACTGCCGTAT
ATCAGGTTAG TTGTTTGCGG AGATGGATGC CTTGATTCTG TGCAAAAGAT GGCAGATGAA
CTGGGCATTG CCTCCAATGT GGAGTTTCGT GGCTGGGTGG ATGCAAGTCA AAAAGCGGAA
GAACTCGCTC GTGCATCGGT TTTCGTGCTT CCCTCTCATG ATGAAGGTCT GCCAATGGCT
ATGCTTGAAG CAATGGCGGC TGAACGGGCG ATTATTGTAA CTCCCGTGGG GGGGATACCA
GAAGTGATCA GGGATAGAGA AAATGGCTTA CTCGTTCCGC CCCGAGATGC CGATGCTCTG
GCACAGGCTT TAAAGGAGGT ACTGGAAAAC CCTCTTCTCC GCCAGATGCT GGCAGAAAAT
GCGCTTAGAA CGATTGAAAG CCGGTTTAGC ACTCCTGTTA TCCTGGGTCA ACTTTCTCTA
CTTTATGAGC GGTTACGAGG AGGCAGCAGG GGGGAAGTAG TTGCATTCAT AAAATGA
 
Protein sequence
MLGTDSTGRG GIASVVTLLQ EEGFLDQQNV KYITSHREGT HFKKLAIMFS ATGKVLWYCM 
FAKPAIVHVH SASGASFIRK SIFLAVARLF RCQTVFHLHG GRFPHFASEE SGVLLKWWIR
RTLERSSTVI ALSESWAAFL STCAPAAAIQ IVPNSVRLAK ISSKQRGEAG RILFLGHVGK
GKGIFELLKA LSLLKDSLPY IRLVVCGDGC LDSVQKMADE LGIASNVEFR GWVDASQKAE
ELARASVFVL PSHDEGLPMA MLEAMAAERA IIVTPVGGIP EVIRDRENGL LVPPRDADAL
AQALKEVLEN PLLRQMLAEN ALRTIESRFS TPVILGQLSL LYERLRGGSR GEVVAFIK