Gene Nmul_A1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1766 
Symbol 
ID3783966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2018737 
End bp2019891 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content55% 
IMG OID637811852 
Producthypothetical protein 
Protein accessionYP_412455 
Protein GI82702889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00135853 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTCG GAAATCTCCC AAAAGAAAAT CCGGGTGTAG TCTCCCCATC CAACTTCATC 
GGATTCGGTC TCCGCTCCAG CCTTCAAATG AAGCATAATA TTCTCGCTCG CCACCCCAGC
GATTTGAATA TCGGCTGGGC GCAACGGGTA GTCAATTGTC ATTGTTCCCA TGTAACGGTA
TCCAGGGTAG ATCTGGTTTC CGTCGATATA GGAACGACGA CACGGGTTCG GATTGCGGTT
GAACATGATG GTCCGGAGAC GATCTCGCGC AAGTGGTTTG TAAAATTACC TTCGCTGGCC
TGGCGGCCAA GGCTGATTAC TGGGTTGCCA GGATTACTTC ATACTGAAAC CCGCTTCTAC
AATGAAACAG CGCAAGCGGT GCCCATCGCC GTACCCGGTT TTCTCGCGGG TCAGAGTAAA
CCCGGCAAGG GTGCGACGCT GGTTTTGAAT GATGTGACTG AATCCGGGGC TGCTGCCGGC
AACCCTGGGG ATGCCCTGAC GGCGGATCGC GCCGCACTTG TCATCAAACA ACTGGCCCGG
CTGCATGCCC GCTTCTGGAA CAAATTCGAT CTTATGCAGA AATATGCCTG GCTGGCGGGC
ATACGCCAAC TCGAAGATCA CCTGGGGACT GCGCTTGCCG TTCCGCTGAT GAAGCGGGGG
CTCCGGCAGG CGGAAAAACT CATACCCTTC CCGCTGCATG CACCCGCTAT AAATTATGCT
CGCCAGCGTC GGCGCGCCAT GCGCTTTCTT TCAGGGCGAC CGCAAACACT CGTTCATCAT
GATTGTCATC CCGGCAACCT GTTCTGGAGC CAAACTCAAC CGGGTCTTCT CGACTGGCAA
TTGGTGCGTT TCGGCGAAGG GATTGGTGAT GTCGCTTATT TTCTTGCTAC CGCCCTAACG
CCCGAGGTAC GGCGAAATCA TGAGGCAAAT CTGCTGGCTA TCTATGCCCA AGAGCTCACG
AACTGTGGTA TCGAAAACAT TGACGGCGAG ATATTGAAGC AGAGATACCG TGCTCACCTC
GTTTATCCAT TCGAAGCAAT GGTTGTGTCG CTCGCTGTCG GCGGAATGAT AAAGCCGGAA
ACCAACCGGG AACTGATTCG CCGCGTCGCC ACCGCTGTGA ACGACCTCGA TGCTTTTGCG
GCAATCCCGC TATAA
 
Protein sequence
MRLGNLPKEN PGVVSPSNFI GFGLRSSLQM KHNILARHPS DLNIGWAQRV VNCHCSHVTV 
SRVDLVSVDI GTTTRVRIAV EHDGPETISR KWFVKLPSLA WRPRLITGLP GLLHTETRFY
NETAQAVPIA VPGFLAGQSK PGKGATLVLN DVTESGAAAG NPGDALTADR AALVIKQLAR
LHARFWNKFD LMQKYAWLAG IRQLEDHLGT ALAVPLMKRG LRQAEKLIPF PLHAPAINYA
RQRRRAMRFL SGRPQTLVHH DCHPGNLFWS QTQPGLLDWQ LVRFGEGIGD VAYFLATALT
PEVRRNHEAN LLAIYAQELT NCGIENIDGE ILKQRYRAHL VYPFEAMVVS LAVGGMIKPE
TNRELIRRVA TAVNDLDAFA AIPL