Gene Nmul_A0271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0271 
Symbol 
ID3785190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp292552 
End bp293604 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID637810347 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_410971 
Protein GI82701405 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTT ACCCAAAATG GCTGCGGCAT TTGCAGGACC ATTCCTACTG CTGGCTGGTC 
ACGGGGGTTG CGGGCTTCAT CGGATCCAAT CTGCTGGAGG CCTTGCTCAA GCATAATCAG
AAAGTAGTCG GATTGGATAA TTTTTCCACC GGCTATTTGC GAAACCTCGA GCAGATACGG
GACTTGGTCG GGGAGAAGGC TTGGGGTAAT TTCAGCTTTA TAGAGGGTGA TATCTGTCAA
CTGGAAACTT GCACGAACGC ATGCCAAGGT GTGGATTTCG TTTTGCACCA GGCCGCGCTG
GGTTCCGTGC CGCGTTCCAT TCAAGATCCC ATCCGAACCA ACGAAGCCAA TATTTCCGGA
TTTCTCAACA TGCTGGTGGC GTCAAGGGAT GCGCAAGTGA GGCGCTTCAT CTACGCTGCC
TCCAGTTCCA CCTACGGCGA CCACCCGGAT TTGCCAAAAG TGGAAGCGGT AATTGGACGT
CCTCTTTCGC CCTATGCTGT CACCAAATAT GTGAACGAAC TCTACGCCGA AGTGTTCGCG
CGTTGCTATG GACTGGACTC CATTGGGCTG CGCTATTTCA ACGTATTCGG TCCAAGGCAG
GATCCCAATG GTGCTTATGC CGCAGTTATT CCCCAATGGG TTTCGGCACT GATCAGAAAC
CAGACGCTGT ATATCAATGG GGATGGGGAA ACCAGCCGGG ATTTCTGTTA TATCGACAAC
GTAGTGCAAG CCAATCTTCT CGCGGCTCTC AGTGATAACA CCGGAGCAGT GAACCAGATT
TACAATGTGG CAGTAAATGA ACGCACCAGT CTGAATCAAC TGTATGGCAT GATGCGCGAG
CTGTTACTGG AGAAGTTTCC GGAGCTGGAG AATCATCGGC CCACGTACGT CGATTTTCGC
AAGGGCGATG TGCGGCATTC ACAGGCGGAT ATTACGAAAG CCACCCAATT ACTTGGTTTT
GAACCCTCCC ACCGCATCGG GGAAGGACTG AGGCAGGCAA TGGGCTGGTA CATCGCGCAT
TTGGGGGCTA TGCAGGAGGC GGCTGGCGTC TAA
 
Protein sequence
MAAYPKWLRH LQDHSYCWLV TGVAGFIGSN LLEALLKHNQ KVVGLDNFST GYLRNLEQIR 
DLVGEKAWGN FSFIEGDICQ LETCTNACQG VDFVLHQAAL GSVPRSIQDP IRTNEANISG
FLNMLVASRD AQVRRFIYAA SSSTYGDHPD LPKVEAVIGR PLSPYAVTKY VNELYAEVFA
RCYGLDSIGL RYFNVFGPRQ DPNGAYAAVI PQWVSALIRN QTLYINGDGE TSRDFCYIDN
VVQANLLAAL SDNTGAVNQI YNVAVNERTS LNQLYGMMRE LLLEKFPELE NHRPTYVDFR
KGDVRHSQAD ITKATQLLGF EPSHRIGEGL RQAMGWYIAH LGAMQEAAGV