Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0278 |
Symbol | |
ID | 3785524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 300168 |
End bp | 301526 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637810354 |
Product | glycosyl transferase family protein |
Protein accession | YP_410978 |
Protein GI | 82701412 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0115898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACACAC TTCCAGATCC AGAAAGTAAC ATTGAGGGCA TCCATCGCTT TCGACGCCGC CACAGAATTG CCGCTCCCTT GTTTCAGCTC CTCGATTCGC ATGTTTTCCT GCCGGCGTGT TTCATACTTT TCGTGGCATT GCGCGTAGCC CTGATTTTCT TTGTGCCCGT TGAGATGACT TCCGATGCAA GCTGGTACTT CAACAGAGCG GTTGGCATAG CGAGCGGCGG CGGCTATTCG GAAGCGGGGT ATCCGACAGC CTATTGGCCG GTTGGCTATC CCGGCTTTCT TGGTATCCTT TTCTATCTTT TCGGTCGGGA TCAATTGGTA GGGCAGATAG CTAATTTGGT TATGGCAGCC CTCAGTTTTT TTCTGCAACT GGAGCTGACA CGAAGAATTT TCCGGAGCGA AGCCGCTGCC CGCTTGGGCG TCCTCCTGCT TACCCTATAT CCGAATCATG TAGCTTATAC GTCTTTTATC TTAACCGAGG TTTACTTTAC CTTTTTACTG CTTCTTGGCG TATACCTCTA CATCACAAGG AGCCGATGGT TATGGATATG GGTTTGCGGC ATTGTCTTCG GGCTGGCCGC GCTGACCAAA CCGCAAGCAG TGTTCTTGCC TGGTCTGCTG GTGCTTTTTC ACGTATTCAG CGCCGAGAGA AAGGATAGGC TGAGGCAGCA CCTCATCAAG GGATTTGCAA TCTATCTTGC TATGGCTATG GTGCTGGTCC CCTGGGCGGT GCGGAACACG ATGATTTTCG GGGAGCTCGT CCTGATTTCC ACTAACGGAG GGGCAACGCT CCTGACCGGA AATCATCCCA CAGCCAGCGG AGGCTATGAG GAAAACGATC CGCTGGTGGC CCAGCGCAAT TTCTCGGTTC AGGATCAGGT GGAATCTGAC CGGCGCGCAA AGAAGCTGGC AACCGATTGG ATAAGGGAAA ACCCCGTGCG GTTTGTAGAG CTTATCCCTC TGAAGATATG GCATCTGTGG TCCAGAAATG GCGAAGCGGA GTGGGCCTAT CAGGCGGGGT ACCGGTACTA TGAGCAGTAT AGCGGCGCGT TCCGAACCAT GAGGTGGATA AATCAGATCT TTTATGCTCT GCTGCTTGTG GGCTCGTTCG CGGCAGCTTT CCTGTTGATA AGGCATCCCG ATAAAGTCGC TTGGCCGTGG GTGCTCGTGG GATACTGCCT GATGATTTAC CTGACCTTGA TATCGGTGGT CTTTTCCGGG CAACCCAGGT TTCATTTTCC GGCCATGCCA TGGGCCATCA TGTATGCAGC GTGGGCAGCC GTGATGATCA CGGTGAATCA GACCCGGGAA CGGTATGACG CCTACGTCTC CAGAACATCG GATTTCTAG
|
Protein sequence | MNTLPDPESN IEGIHRFRRR HRIAAPLFQL LDSHVFLPAC FILFVALRVA LIFFVPVEMT SDASWYFNRA VGIASGGGYS EAGYPTAYWP VGYPGFLGIL FYLFGRDQLV GQIANLVMAA LSFFLQLELT RRIFRSEAAA RLGVLLLTLY PNHVAYTSFI LTEVYFTFLL LLGVYLYITR SRWLWIWVCG IVFGLAALTK PQAVFLPGLL VLFHVFSAER KDRLRQHLIK GFAIYLAMAM VLVPWAVRNT MIFGELVLIS TNGGATLLTG NHPTASGGYE ENDPLVAQRN FSVQDQVESD RRAKKLATDW IRENPVRFVE LIPLKIWHLW SRNGEAEWAY QAGYRYYEQY SGAFRTMRWI NQIFYALLLV GSFAAAFLLI RHPDKVAWPW VLVGYCLMIY LTLISVVFSG QPRFHFPAMP WAIMYAAWAA VMITVNQTRE RYDAYVSRTS DF
|
| |