Gene Nmul_A0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0433 
Symbol 
ID3785901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp480398 
End bp481468 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID637810509 
Productmagnesium and cobalt transport protein CorA 
Protein accessionYP_411133 
Protein GI82701567 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0598] Mg2+ and Co2+ transporters 
TIGRFAM ID[TIGR00383] magnesium Mg(2+) and cobalt Co(2+) transport protein (corA) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATATCCC CTTCCAAAAA GAGATCCAAG AAAGCAGGCC TGCCGCCGGG CATGCCGGTG 
CATATCGGGG CAAAGAAAAA CGCGTCCCCA CGCATAACCT TGCTGGATTA CGATCCGGAA
GGCGTGCGTG AAGCCGAGGT GGCTCCCGCA GACCTTGCCG AAAAGATCAA ACACGCTTCC
GGGGTCAAGT GGGTTAACCT GCAGGGCCTG GGCGACATCC ACATGATCGA GCAGTTCGGC
GCATGTTTTA ACCTTCACCC TCTGGTACTG GAAGACATAT TCAACACAGA GCAACGCTCC
AAGGTGGAGG ATTACGGAGA TTACCTGTAC GTGGTGCTCA AAACCTTCGG GTATGAAACC
AGGGGTAAGG AAGAGAGAAT ATATTCCGAA CAGATCAGTC TTGTACTCGG CAAGGATTTC
GTGCTGTCGT TTCTGGAAGC GAATGGTGTT CAATTCGAGT CGGTCCGCGA CCGCCTGCGG
GCCGGCAAAG GCCAAAGCGC AAGACTTGGT GCCGATTTCC TGATGTACAA CCTGATTGAT
TCAGTGGTCG ACACCTACTT CAGCATTCTT GAACGCCTCG ACGAAAAAAC CGAAGCGCTG
GAAACGGAAC TGGTCGATCG TCCGCAGCCG AGCACCTTGC AATCCATTCA AAGACTCAAA
CGTGAAGGCG TTTTTTTACG CAGGGCGCTG TGGCCGCTCC GCGAGGTAAT CAGTTCTTTG
CAACGTGGAG ATTCACCCCT GTTTAGCCGC AATACTCTGC TCTACCTGCG GGATGTGTAC
GACCATACTG TCCACATTAT CGAATCGATC GAATCGCTGC GTGACGTCAC GGCGGGCATG
CTCGATATTT ATTTATCGAG CGTAAGTTTC CGCATCAGTA CTGTCATGAA AGTTCTTACC
GTCATCACCA CCATATTCAT GCCTTTGACG CTGATAACGG GCATTTACGG AATGAACTTC
ACGTACATGC CGGGGCTTGA ATGGCACATG GGATTTTTTA TTGTACTGAC CGCAATGGCA
GTCATCAGTA TTGCGATGCT GCTACTGTTC CGCTGGAAAA AATGGTTGTA G
 
Protein sequence
MISPSKKRSK KAGLPPGMPV HIGAKKNASP RITLLDYDPE GVREAEVAPA DLAEKIKHAS 
GVKWVNLQGL GDIHMIEQFG ACFNLHPLVL EDIFNTEQRS KVEDYGDYLY VVLKTFGYET
RGKEERIYSE QISLVLGKDF VLSFLEANGV QFESVRDRLR AGKGQSARLG ADFLMYNLID
SVVDTYFSIL ERLDEKTEAL ETELVDRPQP STLQSIQRLK REGVFLRRAL WPLREVISSL
QRGDSPLFSR NTLLYLRDVY DHTVHIIESI ESLRDVTAGM LDIYLSSVSF RISTVMKVLT
VITTIFMPLT LITGIYGMNF TYMPGLEWHM GFFIVLTAMA VISIAMLLLF RWKKWL