Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0243 |
Symbol | |
ID | 3785729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 260176 |
End bp | 261567 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637810318 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_410943 |
Protein GI | 82701377 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03017] chain length determinant protein EpsF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTTC AACAATTTTT ACTCATCCTT CGTGCCCGGT ATAAGGTAGT CCTCTATATT TTATTGTTTA CGGTAATCGC TACCCTTGTA GTCAGTTTGC TATTGCCTAA ACAATATACC GCTGGTACAG CTGTAGTGGT TGACATTAAA TCGCCCGATC CGGTAGCGGG GATGGTATTG CCCGGTTTGA CGTCACCTAC TTACATGGCT ACCCAGGTCG ACGTGATCAA TAGCGACCGT GTAGCAGAGC GTGTCGTAAA AATGCTGCGC CTGGATGAAA GCCCTGTCGT AAAGGAGCAG TGGGAGGAGG ACGGAAGCAA GGGTGAGCTT ACCACCTGGC TGGCCAACTT GCTGAAAAGA AAGCTTGATG TCAAACCTTC CCGGGAAAGC AACGTAATCT ACATTGAGTA TACCGGGAAT GATCCCAATT TTGCCGCCGC TGTTGCAAAC GCTTTCGCTC AAGCTTATAT CGACGTGAAT CTGGATTTGA AGGTCGCCCC TGCCCGCCAG TATGCCCACT GGTTCAAAGG ACAAACTGCC GCTGCCAGGG ATGAGCTAGA GCGAGCCAAA GCTGCACTTT CCGCGTATCA ACAGGAAACC GGTCTCGTCG CTACCGAGGA ACGTCTGGAT TACGAAGTAG CCAAGCTCAA TGAACTTTCA AGTCAACTTA CTCTTATCCA AGCCCAGACA TCGGACAGTA GCAGTAAACG GAAAGCCGCT GAAGATCCGG ACACTTTAGC TGAGGTCATA CAGAGCCCGC TAATAAATAG CCTCAAATCG GATATTGCGC GCCTCGACGC AAAGCTTCAG GAAAGCAGCG CCTATCTGGG ACCAAATCAT CCACAAACCA AACGCACTCA ATCTGAACTC GCGTCTCTCA GAGGCAAGCT ATCCGCTGAA ACACGCAAGA TTCATAGCAG TATCGGCACT TCATATGAAG TGGGGAAGCG AAAGGAGCAA GAGTTGCTCG AGGCGATGGA AAGGCAGAAG GGGCGCGTGT TGGAGCTTAA CAGACAGCGT GATCAGATGA GTGTTCTTCA AGGGGATGTG GAAGCTGCGC AACGCAATTT CGAAGGCATA AGCCAGCGTA GTGCCCTCAC CCGTCTGGAA AGCCTTTCCG TTCAGACTAA CATAACCCCG TTGAACCCTG CTTCAGTCCC GAGTGAGCCG TCAAGCCCTA AATTGCTGCT CAATACATTG ATCTCCATCT TTCTAGGTAC GCTGTTGGGT GTGAGCGCCG CTCTGGTTAT GGAATTGATG AATCGACGCG TGCGTTCTGT AGAGGACATC GTGGAAGCCA TAGAGATTCC GGTGCTGGCG GTAATGTCCG GCCCGTCTCG CAACAAGCTT TCGAGTCGTC TTCCCAAATT ACCCAACCCG GAAACCAGTT AA
|
Protein sequence | MTLQQFLLIL RARYKVVLYI LLFTVIATLV VSLLLPKQYT AGTAVVVDIK SPDPVAGMVL PGLTSPTYMA TQVDVINSDR VAERVVKMLR LDESPVVKEQ WEEDGSKGEL TTWLANLLKR KLDVKPSRES NVIYIEYTGN DPNFAAAVAN AFAQAYIDVN LDLKVAPARQ YAHWFKGQTA AARDELERAK AALSAYQQET GLVATEERLD YEVAKLNELS SQLTLIQAQT SDSSSKRKAA EDPDTLAEVI QSPLINSLKS DIARLDAKLQ ESSAYLGPNH PQTKRTQSEL ASLRGKLSAE TRKIHSSIGT SYEVGKRKEQ ELLEAMERQK GRVLELNRQR DQMSVLQGDV EAAQRNFEGI SQRSALTRLE SLSVQTNITP LNPASVPSEP SSPKLLLNTL ISIFLGTLLG VSAALVMELM NRRVRSVEDI VEAIEIPVLA VMSGPSRNKL SSRLPKLPNP ETS
|
| |