Gene Nmul_A0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0439 
Symbol 
ID3785907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp488291 
End bp489646 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content53% 
IMG OID637810515 
Producthypothetical protein 
Protein accessionYP_411139 
Protein GI82701573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.286489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCC TACCGAAACG CAGAAGAGGG ATGATCTTTG CCTCTGCGGT CAGCCTGTTT 
ACAACTCTCG AGAATCCCGA CAGCGCTTAC GCCAATACCG CCTTGGAATG GTTCAACGAC
AATGGCATTA GACTCGGGGG CTGGATCAAC GGTGGGGCGA CATTCAATCC CAGCCAGCTC
ACCGGTTTCA ATGGGCCAGT CACATTTGCC GATCGCTCCA ACAGATTCCA GTTAAACCAG
TTCAATATTT ATGTGCAACG CCCGGTAGTA GCCGAGGGCA GCACCTGGGA TTTCGGGGGG
CGTATCGATT TCATGTTTGG AACGGATGCA ATTTTTACCC AGGCTTATGG CGTTCCCGCG
TTCGACGTGA ACACAGGCCA GCCTTTAAAC AGGAGCAATT GGGATCTTGA TGTGTGTTGC
GCTTCAACCC GATATTATGG CATTGCGTTT CCGCAGGTTT TTGCCGAAGC CTATGTTCCC
GTTGGGAACG GATTGAACGT CAAGGTAGGC CATTTTTACA CTCCAATCGG TTACGAGTCG
GTACCGGCGC CCGACAATTT CTTTTACACT CATGCCTATA CGATGCAGTA TGGAGAGCCG
TTCACGCATA CTGGTGTGCT GGGCAACTAT AAAATCACGC AAAACTGGAC GTTCATGGGG
GGCGTTACCA CAGGTAGTGC CACTGGCGGT TGGGACGGGG GATTCGACAA GCAGTTGGGT
AATTGGGGGG GGATTGCAGG CATTACCTGG ACCAGCGAGG ATATGGGAAC GTCGGCCAAT
ATCAGCGGAA CCTACAGCGC AACCTCGACA CGCAGCGATG AACCGTGGAT GTTGTACAGC
ATCGTGCTGA AGCATAAATT CACCGAAAAG ACCCATTTTG TGCTGCAACA CGACCACGGG
TTCGCGGGAA ATGTTTTACT GAATAATGTC TTTTATAGTA ACGTGATCAA GGATGCCGAA
TGGTACGGCA TCAACACTCA TCTGTATTAC GATCTCATGC CGGAATTGAC GATCGGAGTG
CGGGCCGAGT GGTTCCGCGA CCGGGACGGG TTCCGTGTAT TTTCACCGGG ACGGGTGGCT
GCCGCCACCG ACAACCGGGG ATTCAGTTAC GCGCTAGGCC GCAATCAGCT TGGCAACAGC
ACCAGCAGTC CGGCTGATTA TTATGCAGTC ACGGTAGGCA TGAACTGGAG GGCGGCGAAG
AGGTTGAAGC TCGACTGGAA GCCGTTGCAG CAGCTCAATA TTCGTCCAAA CGTTCGCTAC
GATGCCGCCG ACGGATTACA TGGCATCGAT TATCGGCCCT TCGGGGGGCA TAAAGATCAG
GTGGTTTTAT CCCTTGATTT TATGGTTCCG TTTTGA
 
Protein sequence
MKRLPKRRRG MIFASAVSLF TTLENPDSAY ANTALEWFND NGIRLGGWIN GGATFNPSQL 
TGFNGPVTFA DRSNRFQLNQ FNIYVQRPVV AEGSTWDFGG RIDFMFGTDA IFTQAYGVPA
FDVNTGQPLN RSNWDLDVCC ASTRYYGIAF PQVFAEAYVP VGNGLNVKVG HFYTPIGYES
VPAPDNFFYT HAYTMQYGEP FTHTGVLGNY KITQNWTFMG GVTTGSATGG WDGGFDKQLG
NWGGIAGITW TSEDMGTSAN ISGTYSATST RSDEPWMLYS IVLKHKFTEK THFVLQHDHG
FAGNVLLNNV FYSNVIKDAE WYGINTHLYY DLMPELTIGV RAEWFRDRDG FRVFSPGRVA
AATDNRGFSY ALGRNQLGNS TSSPADYYAV TVGMNWRAAK RLKLDWKPLQ QLNIRPNVRY
DAADGLHGID YRPFGGHKDQ VVLSLDFMVP F