Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0421 |
Symbol | |
ID | 3784171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 467209 |
End bp | 469119 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810497 |
Product | glycosyl transferase family protein |
Protein accession | YP_411121 |
Protein GI | 82701555 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAGC TGGCTGGAGT TGGGCGACGT CCCCTATTTC CGGATCACAG TCGTTTTGCA GTCCCAGGGT TATTGGCCGC GTTCCTCGAC GGACTGCTGT TCGAGTCGCT GATGAGCTAT GGGGCAAACC TCGCCCTGTC GCATATCGTG GGCTTCCTCG CGGCTGCGAC CTTGCACTAC TGGTTTATTT CGAGACCGCG GCTTTTATAC GACCTCGCGG GCTATTCCAG ATGGACTGAG CTCGGCCGCT TCCTGGTTAT CTGTGTGCTG GCGCTTTCCG TGCGCGGCGG CCTATTAGCG GTGCTAATCC AGGCCTGGGA TATACCGCCG GCTGCGGCCA TACTCCCTGC AATCGCGGCC ACAGCCGTAA TGTTGTATCT GGGATCTGTT TTCTACGTGA TCCCCAGCGG GAACGCCCTA CCCTCGCCTG AGGCAGACTG GCGCATCGCC TCTTTCGGGA TCATCGCATT TGTCATCCTG TTGCGCCTCA TCTACATTGG CGTGGCGCAA CTGATTCCTG ATGAAGCATA TTACTGGAAT TACGCCCAGC ACATGGATCT GAGTTTCTAC GATCACCCAC CTATGGTTGC CTGGTTGATA TGGCTTGGCA CATCCATTTT TGGAAACAGC GAATTTGGCG TCCGGGTCGG AGCATTCCTG TGCGGCTTGA TAACAATGGG GTATTTATTT GCGCTCACAC GCAATCTATA CGATAGCGCA ACGGGGATTC GCGCTGTGCT GCTGCTTGCG ATTCTTCCCT TTTTCTTTGC TACTGGCGCA TTGATGACGA CAGATGCGCC GCTCGTGGCC GCGTGGGCTG CAACTCTGTA CTATCTCGAG CAGGCCCTGA TCGCCCAGCG GGAGCGGGCT TGGCTCGGGG TGGGCATTGC CTTCGGTCTG GGCATTCTCT CCAAGTATAC CCTGGGTTTG GTGGGTATTG CCGCACTGGT TTTTGTGATT ATCGACCCGG TTGCACGCCG CTGGTTGCGC CATCCTTATC CGTACCTCGC TGTACTGCTT GCACTGCTCT TGTTCTCCCC CGTAATTATC TGGAACATGG AGCATGGCTG GATGTCGATA TTGTTCCAGT CCGGCCGCGT CAGGGGCGTA GGAGATGACG AGTTCGGATT ATTCGAATTG ATTCTTCATT TTGCCGTACT GCTTACGCCG ACCGGCTTGC TCGCTGCCGC GCTAGTATTG CTGCCCGGCA CCCGGCAGAA TAATAGCGTC TCCATTTGCA GGCGCCGCCT GTTCATTATG GTATTCACCG GCGTGCCGCT GGCGATCTTC CTGATTCTCA GCATTTTCGA CACCCAGCGG TTCCACTGGA CCGGGCCGCT ATGGCTTATG GCACTGCCTG CCATGGCTTC GATGATGGGG AAAATGCGAA ATTTCTCTGG TAACCCGACG ATTGCGGATC GGGTAATGGC AGCTTGGAGG CCCACTATTC TGGCATGCCT GGTTTGTTAT GCGATTGCAC TGCACTACCT CGTTCTCGGC TTCCCAGGCA TTCCCTACCA GGGCATATAT AAAGGTTTCC CCGAGCATTA TTTCTGGCGC CAGGCAACGA TGGGCATCGA GCAGATTGTA GAGGATACGC GGCGGCAAAC CGGCCAGGAG CCCATCGTCG TGGGCATGAG CAAGTGGTCC GTGGCCAGCA TCGTCACTTT CTATAATCGG GGCAAACCCA TGGAAATACG TTCGCGCAAC ATATTCGGCG ACAGCGGCGC CATGTACGAT CTCTGGTATC CGTCGGAATC CCCTACGACG CGACCCGTCA TTCTCGTCGG CATGTGGCAG GATAATCTCG AATGCGTCCG AGACAGCATC GAGATCGGGA AGATGCTCGC CAATCCTGAT CCGGTTCGGC AACTAGGTTC AGGACCTATT AATTTCATGA TTCACGAATA G
|
Protein sequence | MLELAGVGRR PLFPDHSRFA VPGLLAAFLD GLLFESLMSY GANLALSHIV GFLAAATLHY WFISRPRLLY DLAGYSRWTE LGRFLVICVL ALSVRGGLLA VLIQAWDIPP AAAILPAIAA TAVMLYLGSV FYVIPSGNAL PSPEADWRIA SFGIIAFVIL LRLIYIGVAQ LIPDEAYYWN YAQHMDLSFY DHPPMVAWLI WLGTSIFGNS EFGVRVGAFL CGLITMGYLF ALTRNLYDSA TGIRAVLLLA ILPFFFATGA LMTTDAPLVA AWAATLYYLE QALIAQRERA WLGVGIAFGL GILSKYTLGL VGIAALVFVI IDPVARRWLR HPYPYLAVLL ALLLFSPVII WNMEHGWMSI LFQSGRVRGV GDDEFGLFEL ILHFAVLLTP TGLLAAALVL LPGTRQNNSV SICRRRLFIM VFTGVPLAIF LILSIFDTQR FHWTGPLWLM ALPAMASMMG KMRNFSGNPT IADRVMAAWR PTILACLVCY AIALHYLVLG FPGIPYQGIY KGFPEHYFWR QATMGIEQIV EDTRRQTGQE PIVVGMSKWS VASIVTFYNR GKPMEIRSRN IFGDSGAMYD LWYPSESPTT RPVILVGMWQ DNLECVRDSI EIGKMLANPD PVRQLGSGPI NFMIHE
|
| |