Gene Nmul_A2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2333 
Symbol 
ID3785323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2656305 
End bp2657282 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content54% 
IMG OID637812421 
Productbinding-protein dependent transport system inner membrane protein 
Protein accessionYP_413016 
Protein GI82703450 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATT ACATCATCCG GCGCATCCTG TATGCGATTC CCATCCTGGT TGGCGTAAAC 
CTGATTACTT TCACCTTGTT TTTCGTCGTA AATACGCCCG ATGACATGGC GCGCATGCAG
CTTGGCATCA AGCGTGTTAC ACCGGACGCC ATCGCCAAGT GGAAACAGGA ACGCGGCTAT
GATCAGCCGC TGATGTTCAA CAACGCCCGG GAGGGCCTGG AGAAAATCAC CCGTACCATA
TTTTTCGAGA AATCCGTCGC CATGTTCGCG TTCCGCTTCG GCCGCGCGGA TGACGGGCGC
GATATCGCCT ACGAAATCAG GACGCGCATG CTCCCCAGTC TCGCTATTGC ACTTCCGGTA
TTCGTGCTGG GACTCATCGT TTATATCGCC TTCGCATTGA CGATGGTATT TTTTCGGGCC
ACCTACGTGG ATTTTTGGGG TGTGGTACTA TGCGTCGCGC TAATGTCCAT ATCAAGCCTG
TTCTATATTA TCGGAGGGCA ATTTCTGATC AGCAAATTGT GGCATCTCGT GCCCATATCG
GGTTACAGCC AGGGGCTGGA TTCGGCCAAG TTCCTGGTGC TGCCGGTGAT TATCGGCGTC
ATCAGCGCCG CCGGCGCGAA TACGCGCTGG TACCGGACAC TTTTTCTGGA GGAAATGGGC
AAGGACTACG TGCGTACTGC CCGTGCCAAG GGTCTGCCGG AAAGCGCCGT GCTGTTCCGG
CATGTGCTGC AGAACGCCCT GATTCCAATT CTTACCGGTG CCGTTGTGGT AATTCCCCTG
CTGTTTCTGG GAAGCCTGAT CGTGGAATCA TTTTTCGGCA TTCCGGGGTT GGGCAGCTAT
ACCATCGATG CGATCAACTC TCAGGATTTT GCCGTGGTGC GCGTGATGGT TTTCCTGGGA
TCGTTGCTCT ATATCGTCGG ACTGATATTG ACCGACATTT CCTATACGCT TGTAGATCCG
AGGATAAGGC TGGAATGA
 
Protein sequence
MLNYIIRRIL YAIPILVGVN LITFTLFFVV NTPDDMARMQ LGIKRVTPDA IAKWKQERGY 
DQPLMFNNAR EGLEKITRTI FFEKSVAMFA FRFGRADDGR DIAYEIRTRM LPSLAIALPV
FVLGLIVYIA FALTMVFFRA TYVDFWGVVL CVALMSISSL FYIIGGQFLI SKLWHLVPIS
GYSQGLDSAK FLVLPVIIGV ISAAGANTRW YRTLFLEEMG KDYVRTARAK GLPESAVLFR
HVLQNALIPI LTGAVVVIPL LFLGSLIVES FFGIPGLGSY TIDAINSQDF AVVRVMVFLG
SLLYIVGLIL TDISYTLVDP RIRLE