Gene Nmul_A2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2472 
Symbol 
ID3784821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2826044 
End bp2827270 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content58% 
IMG OID637812563 
Productpatatin 
Protein accessionYP_413153 
Protein GI82703587 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.797753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTCCCGT CATCCTCACG GGTACCGAAA GTGGGCCTCG TCCTGACTGG AGGAGGGGCC 
CGTGCCGCCT ATCAGGTGGG TGTTCTGCAG GCTATCGCGG CGATGCTGCC CAAGAGAACG
CGCACCCCCT TTCCCGTAAT TTGCGGTACA TCCGCCGGCG CATTCAATGC CGCGGTTCTC
GCCATCTCGG CCCGGAACTT TCAGGAGGGT GTGCGACGCC TGTCGGGGGT ATGGGAAAAC
GCGCACGTCA ACCAGGCCTA CCGGACAGAC CCTCTAGGCG TATACGCAAA TGCAATACGC
TGGCTCGCAT CTCTCCTGTT TGGAAGCGTG AAAAACCAGG GCGCGACCTC CCTGCTCGAC
AACTCGCCAC TTGCGCAACT GCTGGAAAAC AGCCTGCCGC TTCAAAGCAT TCAGAAGAGT
ATCGATACCG GCGCCTTGCA TGCTCTCGGC ATTACCGCCT GGGGCTATAC CAGCGGACAA
TCGGTGACGT TCTATCAGGG TGCGGACAGC ATACGGTCGT GGAAGCGGGA ACGCCGCATC
GGTGTCGCCG TTCCTATCGA AATTGAGCAT CTGCTGGCCT CTTCCGCCAT TCCGCTTCTT
TTTCCAGCCG TGCGGCTGAA CCGCGAGTAC TTCGGAGACG GTTCAATGCG CCAGCTTGCG
CCGTTGAGTC CCGCGCTGCA TCTCGGGGCA GACCGTGTGC TGGTGATTGG CGTGCGCAAG
ATAGAGGAAA CACAGCCCGA GCGTGTCAAG GTAGACACCT ATCCCACGCT CGCGCAGATC
GGCGGTCATA TCATGAGCAG TATTTTTCTC GATAACCTTT ATGTCGACTT GGAACGGTTG
CAGCGCATCA ATCGGACCCT ACGCATGATT CCCGAAGAAA AAATGAGAAA TCACGACATG
CCGCTGCGCC AGATTCAACA TATGGTCATT TCCCCCAGCG TTGAATTCAC TGAAATCGCG
CAGCAGCACG CTGCAACCCT GCCGCATACT ATCCGGCTTT TTTACCGGGC CATCGGGGCA
ATGAGACGCG ACGGCTCGTC TCTCCTGAGC TATGTTCTGT TTGAAGAACC CTTCTGCCGC
GCACTCATCG ATCTCGGCTA TCAGGATACG CTGCCGCGCA AAGCCGAACT CTTGCGGTTT
CTCAATGCAG CGCCAATCAA TGGGCCGACG CAAGCAGATT TATCCGACGC TGGTATAATC
CGCAATCCAG TTCCGGGGAT AGGCTGA
 
Protein sequence
MLPSSSRVPK VGLVLTGGGA RAAYQVGVLQ AIAAMLPKRT RTPFPVICGT SAGAFNAAVL 
AISARNFQEG VRRLSGVWEN AHVNQAYRTD PLGVYANAIR WLASLLFGSV KNQGATSLLD
NSPLAQLLEN SLPLQSIQKS IDTGALHALG ITAWGYTSGQ SVTFYQGADS IRSWKRERRI
GVAVPIEIEH LLASSAIPLL FPAVRLNREY FGDGSMRQLA PLSPALHLGA DRVLVIGVRK
IEETQPERVK VDTYPTLAQI GGHIMSSIFL DNLYVDLERL QRINRTLRMI PEEKMRNHDM
PLRQIQHMVI SPSVEFTEIA QQHAATLPHT IRLFYRAIGA MRRDGSSLLS YVLFEEPFCR
ALIDLGYQDT LPRKAELLRF LNAAPINGPT QADLSDAGII RNPVPGIG