Gene Nmul_A0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0633 
Symbol 
ID3785406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp720720 
End bp721988 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID637810715 
ProductSodium:dicarboxylate symporter 
Protein accessionYP_411332 
Protein GI82701766 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TCTCGCTCAA TACGCAAATA TTATGGGGTG TATTCGCCGG CCTCTTCCTG 
GGCCTGGGTT TATCCCTGCT CGATGAAGAA TCGGTGATTT TCCGAGCCGG ATTATATGGC
GCCGAGCTGC TTGGCACGCT GTTTATCGAC CTGTTGAAAA TGGTGCTCAT TCCGCTCGTG
TTCACATCGA TCGCAGTAGG CGTGGCCAAC TTGCGGGCGC ACCAGCAGAT GCATAAGGTA
TGGAAAGCAA CGCTTGGTTT CTTCCTGTTT TCGATGGCGC TGGCTATTCT GCTGGGTTTG
ACCGCAGCCA ACATTGTCCG TCCCGGAGAA GGGCTGCAGC TCGCCATGTT CCAGGATGAC
ATGCAGAACT TCCAGGCCGG GCAGATGCCT TTAACGGAGT TTGTTGCGCA ACTGCTTCAT
TCGCTGTTTC AGAACCCCAT GACCGCCCTG GCTCAGGGGA ATGTGCTCGC AGTGGTCGTT
TTCGCGCTCC TGCTGGGCAT TGCAATGGTG GTGGGCGGGG AGCGCTACGC CAACATCCTC
ATACTGCTGC AGGAGCTGCT GGAGCTGATG CTGATGCTGG TTGGCTGGAT CATGCGCCTT
GCTCCGCTCG GCATCATGGG ACTGCTGGTA AAACTCGCTG CCACACAGGA CGTGACTTTG
CTTGCCACAT TGGTCGAGTT CATCGCGGTG GTGATTGGAG CCACTCTGCT GCACGGGATG
GTAGTGCTCC CGCTGATTCT TTATTTGGTC ACGGGAATGA CGCCGTTCAA ATTCTGGCGC
GGCGCCCGCG AAGCACTGCT AACAGCTTTT GCGACCAGCT CCAGCTCAGC CACCTTACCC
GTCACTTTAC GCTGCGTGGA ACAGCACCTG CACGTCAAAC GCGACATTGC CGGATTTGTC
ATCCCGTTGG GTGCAACACT GAACATGGAT GGCACTGCTT TGTACGAAGC CGTGGCAGCA
TTGTTCGTGG CCAACCTCAT CGGGATAGAA CTTAATCTCG CACAGCAGAT GATCGTGTTT
TTGACTGCGA TGCTGGCTGC CATGGGTGCT CCGGGCATAC CCAGCGCGGG AATGGTCACC
ATGGTAGTCG TGCTGCAATC GGTCGGCTTG CCGGCGGAGG CTATCGCCAT TCTGCTGCCG
GTCGACCGCT TACTGGATAC ATTCCGCACC GCTGTGAATG TCGAGGGGGA CATGGTGGGC
AGCCTCGTCG TGCAGAAATG GGTGAGGAAG GAGTCCATAC GAGGTTCCAG AAGCGATTCC
GAAGGGTAA
 
Protein sequence
MKKISLNTQI LWGVFAGLFL GLGLSLLDEE SVIFRAGLYG AELLGTLFID LLKMVLIPLV 
FTSIAVGVAN LRAHQQMHKV WKATLGFFLF SMALAILLGL TAANIVRPGE GLQLAMFQDD
MQNFQAGQMP LTEFVAQLLH SLFQNPMTAL AQGNVLAVVV FALLLGIAMV VGGERYANIL
ILLQELLELM LMLVGWIMRL APLGIMGLLV KLAATQDVTL LATLVEFIAV VIGATLLHGM
VVLPLILYLV TGMTPFKFWR GAREALLTAF ATSSSSATLP VTLRCVEQHL HVKRDIAGFV
IPLGATLNMD GTALYEAVAA LFVANLIGIE LNLAQQMIVF LTAMLAAMGA PGIPSAGMVT
MVVVLQSVGL PAEAIAILLP VDRLLDTFRT AVNVEGDMVG SLVVQKWVRK ESIRGSRSDS
EG