Gene Nmul_A2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2521 
Symbol 
ID3786646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2885245 
End bp2886834 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content53% 
IMG OID637812612 
Producthypothetical protein 
Protein accessionYP_413202 
Protein GI82703636 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.630268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAG GAGTCCGCCA GGGCATCGCC CCGGACCCCT CACTGGCACT GGAGCATGAG 
GGGTTGAAAG CAGGCGCTCT GCTTGCGCTC GGGGTCATTG CGGGAATTCT CCTTGCTTAT
CATGAGACAA CCTGGTCCAT GATTTCCACC TGGGCCAGAT CGGATACGTT TGCTCATGGC
TTTCTGATCT TTCCGTTCAG CGGCTATCTG ATCTGGGGAG AGCGCAAGCA TCTCGCCACT
CTTTTTGCGC GTCCCAATCC CAAAGTTCTG TTCGTGCTTG CGATTCTCGG TTTCAGCTGG
CTTCTGGGAA CCTTGGCAAG TGTGCAGGTA TTTGCGCAAT TCATGGTGAT AGCGATGATT
TCCGCCGCCG TCTGGGCAAT CCTGGGAAAT CAGATAGCAT GGGCACTGGC CTTTCCATTA
GCCTACCTCC TGCTCGCCGT ACCTTTTGGC GATGCATTCA TACCGCCGCT GATAAACTTT
ACCGCGGACT TTACCGTGAC AGCACTTCAG CTGACCGGAA TTCCTGTCTA TCGCGAAGGC
AATTTTTTTA CCATTCCCAG CGGCAATTGG TCGGTGGTTG AAGCCTGCAG CGGGCTGCGC
TACCTGATTG CATCCTTTAC GCTCGGAACC CTTTATGCAT ACCTGACCTA TCGGAGCCTC
AGCCGTCGGC TGATATTTAT TGCCCTGTCT CTGATCGTGC CGATCCTTGC CAACGGTATC
CGCGCCTATC TGATTGTAAT GACCGGCCAC CTGAGCGATA TGCAGCTCGC GGTGGGCATC
GACCACCTGA TTTACGGATG GGTATTCTTT GGCTTTGTAA TGCTGGTGCT ATTCTGGATC
GGATCCTTCT GGCGCGAAGA TGAGAACAGG TTTGAGGCCG GCTTTACAGC GCAGAGCTCG
ACAGGAAATA ACCGCGCTCC TGGAGGTGAT TCCACCTTGA TAAGGACAAT TTTCACTGCT
GTCGCGGCTT TAAGCATAGC CGTCATCTGG CCGATATACG CGGAGTGGCT GGAAAGGAAG
ATTTCCTACG GTACGGAAGA AGTGGAAATC CATATCTTAG ATACATCCGG CAAGTGGCAA
GGTAGCCCGG AACCCCTTTC CGATTGGAAA CCCATATACA CGGGTTCGAC TGCTCAGTTT
CTGAGAAATT ACCGGAATAA CGGACAGTCG GTAGATCTTT ACATCAGTTA TTACCGCGAC
CAGAAGCAGG GTGCTGAATT GATCAATTCC GAAAATGTGC TTGTAGCGGG AAAGGGATCA
AAATGGCATG AGGCTGGCGA AGATATGCGC AGAATCCTCC TGGGTTCGCA GGAAGAAATC
GTCAAACAGA ACCGGTTGCA CTCCTCTTCG GCATCGCTGC TTGTGTGGCG ATGGTATTGG
ATAGGAGGTG AGGAGACCGC AAATCCGTAT TGGGCAAAGT TCCTGCTGGC CAGGAATAAG
CTCCTGGGAA GAGGCGACGA TGCGGCTGAG CTCATTGTGG CAACACGATA CGAGGACAGC
CTCGATGAAG CAGCTTCAGT GCTTCAGGAT TTTATTATGG ATAGGGCGCC TGCAATAACG
GGCGCACTTC GAAATGCAGC CAACCGTTAG
 
Protein sequence
MNAGVRQGIA PDPSLALEHE GLKAGALLAL GVIAGILLAY HETTWSMIST WARSDTFAHG 
FLIFPFSGYL IWGERKHLAT LFARPNPKVL FVLAILGFSW LLGTLASVQV FAQFMVIAMI
SAAVWAILGN QIAWALAFPL AYLLLAVPFG DAFIPPLINF TADFTVTALQ LTGIPVYREG
NFFTIPSGNW SVVEACSGLR YLIASFTLGT LYAYLTYRSL SRRLIFIALS LIVPILANGI
RAYLIVMTGH LSDMQLAVGI DHLIYGWVFF GFVMLVLFWI GSFWREDENR FEAGFTAQSS
TGNNRAPGGD STLIRTIFTA VAALSIAVIW PIYAEWLERK ISYGTEEVEI HILDTSGKWQ
GSPEPLSDWK PIYTGSTAQF LRNYRNNGQS VDLYISYYRD QKQGAELINS ENVLVAGKGS
KWHEAGEDMR RILLGSQEEI VKQNRLHSSS ASLLVWRWYW IGGEETANPY WAKFLLARNK
LLGRGDDAAE LIVATRYEDS LDEAASVLQD FIMDRAPAIT GALRNAANR