Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1807 |
Symbol | |
ID | 3786358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2066832 |
End bp | 2068493 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637811893 |
Product | hypothetical protein |
Protein accession | YP_412496 |
Protein GI | 82702930 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAAAC AAGCCCGCAA TAAAGAATCG AACCGGGGAC GCTCCGGCAC TTGGGAACCG CTCAAGATAC TGCCATTCAG GGCATTCTGG TTCGCCGCGC TTGGCTCCAA CATCGGAACG TGGATCAATG GCGTATCCTC CGCCTGGGTA ATGACCGACC TGTCTCCCTC ACCGGTGATG GTGTCCCTGG TGCAGGCGGC CACTTCGTTG CCCATGGTAC TGTTCGCGCT GGCTGCAGGT GCGCTGACCG ACATTGTAGA CCGACGGCGT TATCTTCTCT TCACGCAGAT ATGGATGGCC GCGGCTGCGG CAATGCTCAC CGTACTTGCT GCTATCGATC AGATCGATAT CTGGAACCTC CTGATTCTGA CCTTTGCTTT GGGCATTGGC GCATCACTGG CAACTCCCGC GCTGAATATC ACTGCCCCCG AGCTTGTTCC CAGGAGTATG TTGCCGGAAG CGGTCGCATT GAGTTCATTG TCGATGAACC TGTCGCGTTC CCTCGGCCCA GCCATTGCCG GCGTATTGCT GGCCCAGATC GGCCCATGGG CCGCTTATGG CCTGAATGCG CTCTCGTTTA TAGGCATGAT TGTCGTCCTC TGGAGATGGA AGCGCGAACC GGAAGAGCGG TCGCTGCCAC CCGAACGCTT CTTCCAGGCA TTGCGCGCCG GGGTACGTTA TGCTCATGTA GCATCGCCTT TCCGAGCGGT GCTGATTCGT ACCACGGCTT TTATTCTCTT CGCAGCTTCG GGATGGGCGC TTCTTCCGCT GATTGCACGG GTCGAACTGG GCGGGGGACC CGGAACTTAT GGACTCTTGC TCTCCTTCGT CGGTATTGGG GCGGTGTGCG GTATCCTTGT TCTGCCCCGG CTGCATGAAC TCGCCTCCCG CGATCGCCTG GTGCTGGCGG CAAGCCTGAT TTATGGGGCA ACGATCATGG CACTGGCAAT CCTGCAGAGC GAAGAGATGC TTTACGCCAT CATGACGCTT TCCGGCGCGG CCTGGGTTAG CGTGCTCTGG TCACTGCAGG TTACCGCACA AACCTCTGTT CCTGCCTGGG TTCGGGGACG CGCACTGTCT CTCTATATCA TGGTCTTCTC CGCAGGCCTG GCATTGGGAA GCCTGTTCTG GGGATGGGTT GCTGCCAGCA CTACCGTTCC CACTGCCCTC CTGCTATCCT CGGCCGGGAC GATGGTGGCG GCGCTGGCTG TTCGCAATTT CAGTCTGGGC TCCCGGGAGG CTCCGGATCT TGCTCCTTCA TACCATTGGC GGCCGCATCC TCCAGCAATG GAAGAACCTG ACTTGCGCCG GGGGCCCGTG CTAGTCACTG TCGAATATGA GATTGGGCTG GATCAGCGGC GAGCCTTCCT GGAAGCAATC CGCTCACTGG GAGCATCGCG GCGGCGCGAT GGGGCGTTTG CCTGGGGGGT CTTTGAGGAC CTCGAGAAGC CGGGGCGTTA TATCGAATTC TTCCAGCAGG CCTCGTGGCT GGATCATCTG CGCCAGCATG CACGCGTTAC CCGCGAGGAC CAGCGAGTGC AGGAAAACGT CAACCGCTTT CATACGGGCA GCGAAGCCCC ACGCGTTTCA CACTTCATCG GTGGCACACC GACAGCGTCA ACCGACAGCC CGGCGGCAAC GGGAGGCATG ACTGAAGCAT AA
|
Protein sequence | MTKQARNKES NRGRSGTWEP LKILPFRAFW FAALGSNIGT WINGVSSAWV MTDLSPSPVM VSLVQAATSL PMVLFALAAG ALTDIVDRRR YLLFTQIWMA AAAAMLTVLA AIDQIDIWNL LILTFALGIG ASLATPALNI TAPELVPRSM LPEAVALSSL SMNLSRSLGP AIAGVLLAQI GPWAAYGLNA LSFIGMIVVL WRWKREPEER SLPPERFFQA LRAGVRYAHV ASPFRAVLIR TTAFILFAAS GWALLPLIAR VELGGGPGTY GLLLSFVGIG AVCGILVLPR LHELASRDRL VLAASLIYGA TIMALAILQS EEMLYAIMTL SGAAWVSVLW SLQVTAQTSV PAWVRGRALS LYIMVFSAGL ALGSLFWGWV AASTTVPTAL LLSSAGTMVA ALAVRNFSLG SREAPDLAPS YHWRPHPPAM EEPDLRRGPV LVTVEYEIGL DQRRAFLEAI RSLGASRRRD GAFAWGVFED LEKPGRYIEF FQQASWLDHL RQHARVTRED QRVQENVNRF HTGSEAPRVS HFIGGTPTAS TDSPAATGGM TEA
|
| |