Gene Nmul_A1509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1509 
Symbol 
ID3786095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1724956 
End bp1726155 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content55% 
IMG OID637811597 
Producthypothetical protein 
Protein accessionYP_412204 
Protein GI82702638 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCTG TGCCGGAAAA GCTGCTTCCT GAAAGACTGG ATGGGGAATC AGAAGAAGGT 
CCTCCCGGCA CTTCCGCTCA GGCGATAGAT GAGGAGGCCG CGGCACCTCG CGCTCTACCT
CGTTTTGGGA CAGCTATCGC ACTCTGGATA ATCGCGCTGG TCGTGCTGTT GGGAGGCCTG
CATTTCGCGC AGACCTTTTT TGTTCCCTTG TTGTTCGGCG TTCTCGTGAG TAACGCGTTA
AGTCCTGTAG TTGATTGGCT CGAGCGCTGC CGCGTCCCTC GCATACTCGG GGCTGCGCTT
GTGCTTGTTG TCCTGCTTGG CGGCGTTTCA TGGGTAACCT TATCCTTGAG CGGTGATGCA
AGTCTTATAG TTGAAAAACT TCCTGAAGTT GCGCACAAAT TGCGGCATAG TCTGAGAACG
CTGCGATCCG AGGGTCCAAG CGTATTGCAG CAGGTCGAGA AAGCGGCGAA AGAGCTTGAG
AAGGCGGCGG TAGATGCAGG GTTGAAATCG CCCGCGGCGG CAGTGGTTAT TACAAGCCAC
GCGGAAGATG GCGCATGGGT CAAGGATTTT CTACTCAAGC AATCCGCGTT GCTGGTCTCG
TTTGCCGCGC AAATGCCGGT CGTGTTGCTG CTGACCTATT TCCTGCTGGC AGCAGGGACA
CATTTTCGCC GCAAGCTCAT AAAACTGGTC GGGCCATCTC TGACACGCAA AAAGGATGCG
GTTCGAATAC TGGAGGAAGT ACATTTGCAG GTCCAGCGCT ACCTGCTTGT CTTGATCATA
TCGAATACTT TGATCGCCGT ACTCACCTGG TGGGCATTTG AATTGTATGG ATTGGAACAC
GCCGGAGTGT GGGGGGTCGC TGCCGGCATA TTACGTTTTG TTCCTTATCT CGGGACGATG
ACTATCCTGC TGGCAAGCGG TATAGCAGGC TTGCTGCAAT TTGGTTCTCT TCCGCTTGCG
CTCGCGATAG CTGCGACAGC AGTTTTGATT TCCGGCTCTA TTGGAATGTT GTTCGGCACT
TGGTTGCAGG GAAGATTCGC GCGAGTGAAT GAGGCGGTGC TGTTCATCGT GCTGTTATTT
TTTGGCTGGC TGTGGGGCGT GGCCGGCTTG CTTCTGGGGG CGCCGCTATT GGCCGTCGCA
AAAGTGGTTT GCGATCGGAT CGAATCGCTC AAGCCCGTGG GTGAAATGCT GGGGCGGTAG
 
Protein sequence
MSAVPEKLLP ERLDGESEEG PPGTSAQAID EEAAAPRALP RFGTAIALWI IALVVLLGGL 
HFAQTFFVPL LFGVLVSNAL SPVVDWLERC RVPRILGAAL VLVVLLGGVS WVTLSLSGDA
SLIVEKLPEV AHKLRHSLRT LRSEGPSVLQ QVEKAAKELE KAAVDAGLKS PAAAVVITSH
AEDGAWVKDF LLKQSALLVS FAAQMPVVLL LTYFLLAAGT HFRRKLIKLV GPSLTRKKDA
VRILEEVHLQ VQRYLLVLII SNTLIAVLTW WAFELYGLEH AGVWGVAAGI LRFVPYLGTM
TILLASGIAG LLQFGSLPLA LAIAATAVLI SGSIGMLFGT WLQGRFARVN EAVLFIVLLF
FGWLWGVAGL LLGAPLLAVA KVVCDRIESL KPVGEMLGR