Gene Nmul_A0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0142 
Symbol 
ID3784114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp149141 
End bp150487 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content61% 
IMG OID637810213 
Producthypothetical protein 
Protein accessionYP_410843 
Protein GI82701277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTCGAG GCTATCCCGC TTCGGCAGGC GTCGTTCCCG GCGATTCGCT GGTTCTGCAC 
ATTGCGACCG ACGCGCCGCG CTTTCGCGTG GTTTTCTATC GTTGGGGAGA GGGTCTGCTG
CGGGTGTCCG AAACGGATTG GCTGGCAGGA AAATATGCAC CTCCACGAAG CGCGGCGGAA
GACTGGCAAT GGCCGTCATA CGCGTTTCCT GTTCCGCACG ACTGGCTGTC CGGAGTCTAT
ATTGCTCACC TGGAGGAACC GGGGGGCAAT GCGGTGTCCC TGGCAATGGA AAGCGCCGCC
GTCCTGTTCG TCGTCCGCGG CAGTGGACGC AGCAAGCTGC TCTACAAGAT TCCGCTCGCC
ACTTACCATG CCTATAACTG CACCGGCGGT GGCTGCTTCT ACGTCAATCC TCCGCGTTCA
GAGGACCCGC CAGGGGCCAG GCTCTCGCTG CTTCGTCCGG GTGGCGGCAT CGGCGGCGAG
ACCTGGGGAG CGCTCGACTA CTACGATTTG AGTTCGCCGC GCCAGACTTT CGCCCATTGG
GATGCGCGCT TCATTCGCTG GCTGCTGCGC AATGGATACC AGCCTGAATT CTGCACCGAT
CTCGATATCC ACTCCGATCC GGATCTGTGC GGTCGCTACC GGCTTATGCT GAGCGTGGGA
CACGACGAAT ACTGGAGCGA GACGATACGC GACCGGACGG AAGATTTTGT GTCAAAAGGA
GGCAACGTTG CCTTCTTCGG CGCCAACCTC TGCTGGTGGC GAATTCATAT CCTGCATGGC
GGAGCGGCGA TGCTATGCCA TCAAGGCGGT CCTCATGGAG CGCTTGATCA CTGGTGGCCA
TCCACAGGTG TGGCTCGTCC CGAGGATTCG CTGACAGGGG TGAGTTATCG GAACGGGGGC
GGCTGGTGGG ATGGGCCCCG GAACACGGGA GGGTATATCG TGCAGGATCC GGAGCACTGG
GTGTTTGCCG GCACAGGGTT GGGTCGCGGC GATGCGTTTG GTGACAAAAC CTCGCCACCC
CTGGTCGGAT ACGAGTGCGA TGGCGCTCCG CTGGATGACT TCGACAAGGC TTCGGGACTT
GCAGTCCTTT CCTCGAACGC CGGAAATACC GGTACTCCAG AGGGGTTTCG TGTTCTGGCG
GCAAGCGTGC TGGATGGCAA TTGGCAGGAA CTCCCCCCTC GGGAGGCTTA TCCCGCCCGT
GAAGGTATTC ACGCGGCAAC AATGGGAATT TTTTCGCGGA ACGGCGCGGT GTTCACCGCG
GGAACTACCG ACTGGGCTCA GGTACTCGAA AATCCGCTGG TCGATGCCAT CACACGTAAT
GTCATCGACC AGCTGCTTCG CGAGTAA
 
Protein sequence
MIRGYPASAG VVPGDSLVLH IATDAPRFRV VFYRWGEGLL RVSETDWLAG KYAPPRSAAE 
DWQWPSYAFP VPHDWLSGVY IAHLEEPGGN AVSLAMESAA VLFVVRGSGR SKLLYKIPLA
TYHAYNCTGG GCFYVNPPRS EDPPGARLSL LRPGGGIGGE TWGALDYYDL SSPRQTFAHW
DARFIRWLLR NGYQPEFCTD LDIHSDPDLC GRYRLMLSVG HDEYWSETIR DRTEDFVSKG
GNVAFFGANL CWWRIHILHG GAAMLCHQGG PHGALDHWWP STGVARPEDS LTGVSYRNGG
GWWDGPRNTG GYIVQDPEHW VFAGTGLGRG DAFGDKTSPP LVGYECDGAP LDDFDKASGL
AVLSSNAGNT GTPEGFRVLA ASVLDGNWQE LPPREAYPAR EGIHAATMGI FSRNGAVFTA
GTTDWAQVLE NPLVDAITRN VIDQLLRE