Gene Nmul_A2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2103 
Symbol 
ID3784674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2393683 
End bp2394951 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID637812191 
Productmajor facilitator transporter 
Protein accessionYP_412788 
Protein GI82703222 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.941752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGG TAAAGAAGGT TAAGTCGGTG GAAACAGGCC GGATCGGCAG AACACTTACC 
CGCCTGCTCA TGCCGGTAGG AGTAGCCGAG GTTGCTTCCC CCTTACTGAT CGCTCGCGCA
CTGCGTGGCT TTGCAGACGG TTATGTCGTT GTGCTTTTGC CTGCGTACCT GCTGGCGCTG
GGGTTTGACC AGCTATATGT AGGCTATTTA AGTACGGCTA CCCTTGCGGG ATCCGCCCTG
GCGACATTGG CGATCGGAGC AATGGGCTAC CGCTGGTCAA GCCGTCGGCT GTTGCTGTTC
GCTGCCCTGT TAATGACAGC TACCGGCCTT GCTTTCGCGA GCGTTTCTTC GTTTTTGCCG
CTCCTCATTG TCGCATTTGT TGGAACCCTT AATCCGAGTT CGGGCGATGT GAGCATGTTT
TTACCCCTGG AGCATGCTCG TCTGGCACAG GGAGCAACTG GTAACCTACG CACCACTCTG
TTTGCCCGTT ACACGCTTAC TGGTTCGTTA AGCGCGGCTG TAGGTGCCTT GGCCGCTGGC
ATCCCCTCCT GGCTGATGCA ATGGGCAGGG CTACCCTTGA TTGATGGTTT GCGCACAATG
TTTGTGCTGT ACGGCCTGCT CGGCGGATTG GTTTGGCTGC TCTATCGGGC TCTGCCAGCG
GGCGACATCT CTACCAAGCA GTTACCAGCG CCGCTAGGTC CGTCACGCAG TATCGTTTTC
AAGCTTGCGA TGTTGTTCAG CCTTGATGCA TTTGCCGGGG GATTGGTGGT GAATTCGCTA
CTCACACTCT GGCTTTTTGA GCGATTTGGA CTCTCACCGG GAGAGGCAGG AACCTTTTTC
TTCTGCGCAG GGCTTCTATC GGCCGGATCG CAGCTCGCTG CACCTGTAAT TGCCCGCAAG
ATCGGACTGC TTAACACGAT GGTCTTCACT CACATTCCTG CAAACGTGTG TCTCATTTTT
GCCGCAATTG CTCCAAACGT AGAAATCGCG CTGACGCTAT TGTTCGTGCG AAGTGCATTA
TCGCAGATGG ATGTGCCCAC ACGAACTGCC TATGTGATGG CTGCAGTTAC GCCACCCGAG
CGTGCTGCCG CGGCAAGCTT CACTACCGTA CCTCGCAGCC TTGCCTCAGT CCTGAGTCCA
AGCTTGGCCG GCGCCTTGCT GGCCAGCGGG CTTCTTAGCG CTCCCTTAAT TGCTTGTGGA
GCTTTAAAGA TTGCATATGA CTTGGCGTTG CTGGTGTCCT TCAGCCGCGT GAATGTTCCG
CTGGATTAG
 
Protein sequence
MNQVKKVKSV ETGRIGRTLT RLLMPVGVAE VASPLLIARA LRGFADGYVV VLLPAYLLAL 
GFDQLYVGYL STATLAGSAL ATLAIGAMGY RWSSRRLLLF AALLMTATGL AFASVSSFLP
LLIVAFVGTL NPSSGDVSMF LPLEHARLAQ GATGNLRTTL FARYTLTGSL SAAVGALAAG
IPSWLMQWAG LPLIDGLRTM FVLYGLLGGL VWLLYRALPA GDISTKQLPA PLGPSRSIVF
KLAMLFSLDA FAGGLVVNSL LTLWLFERFG LSPGEAGTFF FCAGLLSAGS QLAAPVIARK
IGLLNTMVFT HIPANVCLIF AAIAPNVEIA LTLLFVRSAL SQMDVPTRTA YVMAAVTPPE
RAAAASFTTV PRSLASVLSP SLAGALLASG LLSAPLIACG ALKIAYDLAL LVSFSRVNVP
LD