Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0414 |
Symbol | |
ID | 3784164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 458663 |
End bp | 460564 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810490 |
Product | glycosyl transferase family protein |
Protein accession | YP_411114 |
Protein GI | 82701548 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCAGA CATACTCTCC CTCAGTCAGC GTAGTAGTTC CCACTTATGA GCAAGCACGA TTCATTGGCC GCGCGTTGGA CAGCCTGCAG GCGCAGGTTT TGACGGATTG GGAAGCAGTC GTCATCGACG ATGGTTCACG GGATGCTACA GCCGAAGTCG TATCGGCATA CCTGGGCGAC ACCCGCATTC ATTACTATCG CTTTCCCGAA AATCGGGGGT TGGGTCGCAC CCTGAATGAA GGCATTGCCA AAGCAAAAGC GCCGCTCATC GCTTACTTGC CAACCGATGA TGTTTATTAC CGCGATCACC TGGGCAGCCT GAAAACATGC CTCGAGACGC AGAAAGGCGC TGTGTTGGCT TGTTCAGGCG TTCGCCACCA TTACAACCGC GAGGCCATTG GGCAAATTCC GGAATTTCCT TTGCAACTGG TGCAGTGCAT GCATCGGAAA ATGCCGGTGC GGTGGGTGGA GCGCGCAGAA TTGGAGTCGG ATGATCTTGA GCGCCTCTAC TGGAGCCTGC TGCGGCCCCT CGGCGCTTTT GCCGAAAGCG GAACTCTTAC CTGCGAGTGG GTCTCCCACC CGATGCAACG CCACAAGATC ATGCAGGAAC CGGAAGGCGG CATCAACACG TTTCGCTCCC ATTATCGGGT CAAAGAACCG CTGCGCTTTC ATACAACCGT AGGCCATCGG ATCGATGAAG CGGACCATTA CCGAAAAATG CGCGAACGGC CGGATACTCC TCGTGCCGAC AACGGGTTGA AAATACTTCT GGTAGGAGAA CTGGCCTATA ACGCCGAGCG CGTGCTTGCC CTGGAAGAGC GGGGACACAA GCTGTATGGC CTGTGGATGC AGAACCCGTA CTGGTACAAC ACGGTAGGGC CGATGCCTTT TGGGCATGTG GAGGACCTGC CGCGCGACAA CTGGCGCGAA GCAGTGAAGC AGGTGCAGCC TGATGTTATC TATGCATTGC TCAACTGGCA GGCAGTGCCG TTTGCACATG AGGTATTGAT GGCCACGCAC GGCATTCCTT TCGTCTGGCA TTTCAAGGAA GGTCCTTTCA TCTGTCTTGA AAAGGGAACC TGGCCGCAAC TGATCGACCT GCACCGGTAT TCGAACGGCC AAATCTTCTC CAGCCCCGAA ATGCGGGATT GGTTTGATAC CATCATTCCC GGTTTGTCGC TGGATAAACC CACCCACGTG CTGGATGGAG ACCTGCCGAA GCGTGACTGG TTTGATCAGC CTCGTGCGCC GCTGCGTTCT GAAAGCGAAG GGGATATTCA TACCGTATCA CCCGGACGAC CCATCGGGCT GCATCCGCAT CATGTTGCGG AGTTGGCCCA CCATGGCATT CATCTGCATT TTTATGGGGA GATAACCCAT GGACAATGGC TGCAATGGAT TGAAAAAACC CAAGCAATCG CGCCAGACTA CCTGCATTTA CACCCGAACG TGGATCAAAG TCACTGGACT GCCGAGTTCT CCCAATATGA CGCGGGCTGG CTGCATCTCT TCGAAAGCAG CAACAGAGGG GAAATCAGGC GGGCAAACTG GGATGACCTG AACTACCCGG CAAGGCTGAG TACCCTTGCA GCGGCCGGCC TGCCGATGCT GCAGAAGGCG AATGAGGATG CTCTCGTCGC CACTCGTACA TTAGCAAAGC AGCTTGATAT TGGAATTTTT TTTGATACCG TGGATGAGCT CGCCGCACAG TTGCGCGACT GCAAGCAGTT GCGAGCGACG CGTGAGCGGG TCTGGCAGCA GCGGCACCTG TTCACATTCG ATCATCATGT CCCAGAACTG GTTGATTTCT TCCGCCTCGT AATAGCGTCC ACTTCCAGGA AACAGACACG CACCGGAACT ACAGTAGAAT CTGCTCATCC GCAAAGCCGC AGGCTGGCAT GA
|
Protein sequence | MPQTYSPSVS VVVPTYEQAR FIGRALDSLQ AQVLTDWEAV VIDDGSRDAT AEVVSAYLGD TRIHYYRFPE NRGLGRTLNE GIAKAKAPLI AYLPTDDVYY RDHLGSLKTC LETQKGAVLA CSGVRHHYNR EAIGQIPEFP LQLVQCMHRK MPVRWVERAE LESDDLERLY WSLLRPLGAF AESGTLTCEW VSHPMQRHKI MQEPEGGINT FRSHYRVKEP LRFHTTVGHR IDEADHYRKM RERPDTPRAD NGLKILLVGE LAYNAERVLA LEERGHKLYG LWMQNPYWYN TVGPMPFGHV EDLPRDNWRE AVKQVQPDVI YALLNWQAVP FAHEVLMATH GIPFVWHFKE GPFICLEKGT WPQLIDLHRY SNGQIFSSPE MRDWFDTIIP GLSLDKPTHV LDGDLPKRDW FDQPRAPLRS ESEGDIHTVS PGRPIGLHPH HVAELAHHGI HLHFYGEITH GQWLQWIEKT QAIAPDYLHL HPNVDQSHWT AEFSQYDAGW LHLFESSNRG EIRRANWDDL NYPARLSTLA AAGLPMLQKA NEDALVATRT LAKQLDIGIF FDTVDELAAQ LRDCKQLRAT RERVWQQRHL FTFDHHVPEL VDFFRLVIAS TSRKQTRTGT TVESAHPQSR RLA
|
| |