Gene Nmul_A0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0903 
Symbol 
ID3784950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1027803 
End bp1029230 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content60% 
IMG OID637810985 
Producthypothetical protein 
Protein accessionYP_411598 
Protein GI82702032 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.354068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGG AGCGCCTACG TCAACTGCAA CCCTGCCTGT GGACATTGCC GCGCGCACCG 
GGCGAAGAAC GCGCGCAGGT CCTCCTGTAT GGCAGCGCGC CCTTGCTCGT CAGCATGGAC
GACAAGGTAC TCGAGCAGAT CGCCAACGTT GCTTCCCTGC CGGGGCTGGT TGGAGCGGCA
ATGACCATGC CGGATGCGCA TTGGGGATAT GGTTTTCCCA TAGGGGGTGT TGCTGCTTTC
GATGCTGAGC AGGGGGGGGT GATTTCCGCA GGCGGGGTCG GCTTCGATAT TTCGTGCGGC
ATACGCTGTC TGCGCAGCAA TCTGAATCTG GAGGATGCGG TAGAGCATTT TCCCCAGCTC
GGAAAAGCCT TGTTCCGCGC TATCCCTGCT GGAGTGGGCG AGGAGGGCGA GATCAAGCTG
AACCCGGAGC AGCTCGACCA GGTGATGCAC GGCGGTGCAC ACTGGGCGGT ACAGCAAGGC
TACGGCACTC CAGCAGATCT GGATTATGTT GAAGAACAGG GGCGGGTAGC AGGAGCGATC
CCGGAAAACG TATCCGAACT TGCCAAAAAG CGCCAGCGCG GCGAGATGGG CACGCTCGGC
TCAGGCAATC ATTACCTGGA AGTACAGGTG GTGGACCGCA TCTTCGATCC TGGCGTCGCG
CTGGCGCTTG GCCTGCATGA AGGACAAATC CTGATTTCGA TACATTGCGG TTCGCGTGGG
CTGGGGCACC AGATCGGTAC CGATTATCTG GTGCTATTGG CAAAAGCAGC CAGCCGCTCG
GGCATTCATT TACCCGATCG TGAACTCGCT TGCGCGCCTG TCAAATCCCC CGAAGGCCAG
CAATATATCG GCGCGATGAA TGCTGCGATC AATTGCGCGC TTGCCAACCG GCAAATCCTG
ACGCATCTTA CGCGCTCCGT ATTTACGGAA ATTTATCCCC AGGCCGAGCT TGAAACCTTG
TTTGATGTTT CGCACAATAC CTGCAAGGCC GAAACCCATC AGATCGACGG TGAATCGAGG
TTGCTGTATG TGCACCGCAA GGGCGCTACA CGCGCATTCG GCCCGGGCCA CCCCATGCTG
CCGGAACGCT ACCGCCAGGT AGGACAACCC GTCGTTATCG GTGGAAGCAT GGGAACAGGC
TCCTACATTC TCGTTGGTGA CAGCGAAAAT CCCGCCTTCG CTTCTTCAAG CCACGGTGCA
GGCCGGGCCA TGAGCCGGCA CCAGGCACTC GCGCGATGGA AAGGACGTGC GCTGGTGGAC
GAGTTGGCGC AACAAGGTAT TCTGATCCAT ACCCGCTCCA TGCGAGGTGT GGCGGAAGAA
GCCCCGGGCG CCTATAAGGA TGTCGATCTG GTGGCGGAGG CCACGGAAGA AGCCGGGCTC
GCCCGGCGCG TCGCGTTTCT CCGACCCAAA GTCTGCGTTA AGGGTTAA
 
Protein sequence
MNLERLRQLQ PCLWTLPRAP GEERAQVLLY GSAPLLVSMD DKVLEQIANV ASLPGLVGAA 
MTMPDAHWGY GFPIGGVAAF DAEQGGVISA GGVGFDISCG IRCLRSNLNL EDAVEHFPQL
GKALFRAIPA GVGEEGEIKL NPEQLDQVMH GGAHWAVQQG YGTPADLDYV EEQGRVAGAI
PENVSELAKK RQRGEMGTLG SGNHYLEVQV VDRIFDPGVA LALGLHEGQI LISIHCGSRG
LGHQIGTDYL VLLAKAASRS GIHLPDRELA CAPVKSPEGQ QYIGAMNAAI NCALANRQIL
THLTRSVFTE IYPQAELETL FDVSHNTCKA ETHQIDGESR LLYVHRKGAT RAFGPGHPML
PERYRQVGQP VVIGGSMGTG SYILVGDSEN PAFASSSHGA GRAMSRHQAL ARWKGRALVD
ELAQQGILIH TRSMRGVAEE APGAYKDVDL VAEATEEAGL ARRVAFLRPK VCVKG