Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0633 |
Symbol | |
ID | 3785406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 720720 |
End bp | 721988 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810715 |
Product | Sodium:dicarboxylate symporter |
Protein accession | YP_411332 |
Protein GI | 82701766 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCTCGCTCAA TACGCAAATA TTATGGGGTG TATTCGCCGG CCTCTTCCTG GGCCTGGGTT TATCCCTGCT CGATGAAGAA TCGGTGATTT TCCGAGCCGG ATTATATGGC GCCGAGCTGC TTGGCACGCT GTTTATCGAC CTGTTGAAAA TGGTGCTCAT TCCGCTCGTG TTCACATCGA TCGCAGTAGG CGTGGCCAAC TTGCGGGCGC ACCAGCAGAT GCATAAGGTA TGGAAAGCAA CGCTTGGTTT CTTCCTGTTT TCGATGGCGC TGGCTATTCT GCTGGGTTTG ACCGCAGCCA ACATTGTCCG TCCCGGAGAA GGGCTGCAGC TCGCCATGTT CCAGGATGAC ATGCAGAACT TCCAGGCCGG GCAGATGCCT TTAACGGAGT TTGTTGCGCA ACTGCTTCAT TCGCTGTTTC AGAACCCCAT GACCGCCCTG GCTCAGGGGA ATGTGCTCGC AGTGGTCGTT TTCGCGCTCC TGCTGGGCAT TGCAATGGTG GTGGGCGGGG AGCGCTACGC CAACATCCTC ATACTGCTGC AGGAGCTGCT GGAGCTGATG CTGATGCTGG TTGGCTGGAT CATGCGCCTT GCTCCGCTCG GCATCATGGG ACTGCTGGTA AAACTCGCTG CCACACAGGA CGTGACTTTG CTTGCCACAT TGGTCGAGTT CATCGCGGTG GTGATTGGAG CCACTCTGCT GCACGGGATG GTAGTGCTCC CGCTGATTCT TTATTTGGTC ACGGGAATGA CGCCGTTCAA ATTCTGGCGC GGCGCCCGCG AAGCACTGCT AACAGCTTTT GCGACCAGCT CCAGCTCAGC CACCTTACCC GTCACTTTAC GCTGCGTGGA ACAGCACCTG CACGTCAAAC GCGACATTGC CGGATTTGTC ATCCCGTTGG GTGCAACACT GAACATGGAT GGCACTGCTT TGTACGAAGC CGTGGCAGCA TTGTTCGTGG CCAACCTCAT CGGGATAGAA CTTAATCTCG CACAGCAGAT GATCGTGTTT TTGACTGCGA TGCTGGCTGC CATGGGTGCT CCGGGCATAC CCAGCGCGGG AATGGTCACC ATGGTAGTCG TGCTGCAATC GGTCGGCTTG CCGGCGGAGG CTATCGCCAT TCTGCTGCCG GTCGACCGCT TACTGGATAC ATTCCGCACC GCTGTGAATG TCGAGGGGGA CATGGTGGGC AGCCTCGTCG TGCAGAAATG GGTGAGGAAG GAGTCCATAC GAGGTTCCAG AAGCGATTCC GAAGGGTAA
|
Protein sequence | MKKISLNTQI LWGVFAGLFL GLGLSLLDEE SVIFRAGLYG AELLGTLFID LLKMVLIPLV FTSIAVGVAN LRAHQQMHKV WKATLGFFLF SMALAILLGL TAANIVRPGE GLQLAMFQDD MQNFQAGQMP LTEFVAQLLH SLFQNPMTAL AQGNVLAVVV FALLLGIAMV VGGERYANIL ILLQELLELM LMLVGWIMRL APLGIMGLLV KLAATQDVTL LATLVEFIAV VIGATLLHGM VVLPLILYLV TGMTPFKFWR GAREALLTAF ATSSSSATLP VTLRCVEQHL HVKRDIAGFV IPLGATLNMD GTALYEAVAA LFVANLIGIE LNLAQQMIVF LTAMLAAMGA PGIPSAGMVT MVVVLQSVGL PAEAIAILLP VDRLLDTFRT AVNVEGDMVG SLVVQKWVRK ESIRGSRSDS EG
|
| |