Gene Nmul_A1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1922 
Symbol 
ID3784160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2210900 
End bp2212048 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content54% 
IMG OID637812008 
Productmajor facilitator transporter 
Protein accessionYP_412609 
Protein GI82703043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATCCA TCCCCTATTG GCGTCTTTCC GGCTTCTATT TCTTCCATTT TGCCTTTATT 
GGCGCCTTTG CTCCTTACTG GACCCTTTAC CTCAAATCCC TTTCTTTCGC TTCCTTTCAG
ATCGGGGTGC TCATGTCCCT GCTGCATGTC ACTCGCATCT TTGCCCCGGC TGCATGGGGC
TGGCTTGCAG ATCACGTCGG CAAGCGAATG TTCATTGTAC GCTCGGCTGC AATTGCAGGC
TTGGTCAGCT ACTGTGGCGT TTTTCTCGGC GAGTCCTATA GCTGGCTGTT TGTGGTCATG
GCGCTGATGA GTTTTTTCTG GAGCGCTTCC CTGCCGTTGA TCGAGGCAAC CACACTTTCA
TACCTGGGAG AAAACATCAC AAAATACGGA CTCATCCGGG TGTGGGGTTC AGTAGGATTC
ATTCTGGCGG TAACCGGGGT TGGTTATCTG CTGGATGCGA CCAGCATCAG CTCGCTGCTA
TGGGCTGTCC TTGGCTTCAA GCTCGGTATT GTCTTTTTTT CACGACTGAT TCCTGAAGTC
GGGACAGCAA CGCATCCTGC CACCGAGCAT TCCATTCCAC AAATATTCCG GCGGCCAGAA
GTACTGGCCT TTTTTGCAGC GTGCCTGTTG ATGGTGTTTG CGCACGGCCC CTACTACACC
TTTTATTCGA TCTATCTTGT CGAGTACGGA TACAGCAAAA GTCTCGTAGG CTGGCTGTGG
GCCACAGGGG TTATCTGTGA GATCGGCATA TTTTTCCTGA TGCCGCAGTT GATGCGCCGA
TTCCGCATGA AACAGATCAT GGTGTTCAGT TTCAGCTGTG CCGTAGCACG CTTCCTGATG
ATAGGCTGGG GCGTGGAATG GCCGTTTGTC ATATTTTCTG CACAGGTGCT GCATGCCGCA
ACCTACGGGG CGCATCACGC CACCGCCATG ATGGTGGTGC ATCGGCTCTT CGGTGGGCGC
CACCAGGCGA AGGGGCAAGC CCTCTACACC AGTCTCACAT TCGGGCTCGG CGGCACTATT
GGGGGCATAT TCAGCGGTTA TTCGTGGGAT TGGCTGGGGG CAGGACTCAC TTTTACGATC
AGTGCGATGG CCGTGTCGCT GGGCTTGGGG CTGGTAGTCT GGAAGATGGA CATCGACGGG
TCCGCGTGA
 
Protein sequence
MQSIPYWRLS GFYFFHFAFI GAFAPYWTLY LKSLSFASFQ IGVLMSLLHV TRIFAPAAWG 
WLADHVGKRM FIVRSAAIAG LVSYCGVFLG ESYSWLFVVM ALMSFFWSAS LPLIEATTLS
YLGENITKYG LIRVWGSVGF ILAVTGVGYL LDATSISSLL WAVLGFKLGI VFFSRLIPEV
GTATHPATEH SIPQIFRRPE VLAFFAACLL MVFAHGPYYT FYSIYLVEYG YSKSLVGWLW
ATGVICEIGI FFLMPQLMRR FRMKQIMVFS FSCAVARFLM IGWGVEWPFV IFSAQVLHAA
TYGAHHATAM MVVHRLFGGR HQAKGQALYT SLTFGLGGTI GGIFSGYSWD WLGAGLTFTI
SAMAVSLGLG LVVWKMDIDG SA