Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1413 |
Symbol | murD |
ID | 3903394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1702379 |
End bp | 1703794 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637878750 |
Product | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase |
Protein accession | YP_480519 |
Protein GI | 86740119 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase |
TIGRFAM ID | [TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0329429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0755004 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCG ACGGTCAGGG CCCGGGCGAC GGTCAGGGCC CAGGCAACCT CACGACCTCG ATCTCCTGGG CGGGCCTGCC CGTGCTGGTC GTCGGGATCG GGGTCTCCGG GCTCGCCGCG GCCCGGGCGC TGCTTGCCCG CGGTGCGCGG GTCCGGGTCG TCGACGCCGG CGACTCCCCA CGTCACCAGG GTGCGGCCGC GACGCTGCGG GCGCTCGGGG CCGAGGTGAA CCTCGGCGGC CTGCCGGCCG GCCCCGGGGA CAGCGCCCTC GTCGTCACCT CGCCCGGCGT CCCGCCGACG GCCCCGCTGA TCACCGGGGC GGCCGGCGCC GGGATCCCGG TCTGGGGGGA GGTGGAGCTC GCCTGGCGGT GGCGGGGTGG GAGCCGGTGG CTCGCCGTCA CGGGAACGAA CGGCAAGACG ACGACCACCG AGATGCTCGG CGCGATGCTC GCCGCCGGTG GCCGGCGATC CACCACCGCC GGGAACATCG GGACCCCGAT CGTGGACGCG GTTGCCGCCG AGCCGCCCTA CGAGACCCTC GCGGTGGAAC TGTCCAGCTT CCAGCTCCAT TACACCCACA CCATGGCCCC CCTCGCCGCG GCCGTGCTCA ACGTCGCCCC CGACCATCTC GACTGGCACG GGGGCGCGGC GGCCTATGCG GCAGCGAAGG CGGGGATCTG GCGTCGTCCC GGGACGACCG CGATCGGCAA CGCCGACGAC GCAACCAGCG CGGACCTGCT CGCCGCCGCT CCGGGGCGGC GGGTCCTGTT CGGTCTCGAC CCGGCGGCCC GGCCGCGGCC CGGCCTGACC GTCGTCGACG GCCACCTCGT CGACGACGCC TTCGGTGGCG GGCGGCTGGT CGCCGTCCGT GACCTCGTGC TGACAAGTCC GCACATGATC TCCAATGCGC TGGCCGCCGC CGCGCTTGCC CGCGCCGAGG GCGTTGGCCC GGCCGCGATC GGCGCGGCGC TCGTCGCTTT CCGGCCCGGC GCCCACCGCA ACGCCGAGGT GGCCGTCATC GACGGGGTCC GCTGGGTGGA TGACAGCAAG GCGACGAACC CACACGCCGC CGCCGCGTCG CTGGCCGGCT ACCCCTCGGT GGTGTGGATC GCGGGCGGCC TCAACAAGGG CCTCGCCTTC GACGATCTCG TCCGCGACGC GCGCCGGGTG CTGCGCGCCG CGGTCCTCAT CGGGCGCTGC GCCGATGAGA TCGCCGCCGC TCTCGCCCGA CACGCCCCCG ATGTCCCCGT GGAACGGGCT GACGGCATGG ACGATGCGGT GAAGGCTGCG GCGGCGTTCG CGACCACCGG TGACACTGTG CTCTTGGCGC CGGCCGCGGC CTCGATGGAC ATGTTCCGGG ACTACGCGGC GCGCGGAGAT CTGTTCGCGG CGGCGGTCCG CGCACGAGAG GGGTAA
|
Protein sequence | MTGDGQGPGD GQGPGNLTTS ISWAGLPVLV VGIGVSGLAA ARALLARGAR VRVVDAGDSP RHQGAAATLR ALGAEVNLGG LPAGPGDSAL VVTSPGVPPT APLITGAAGA GIPVWGEVEL AWRWRGGSRW LAVTGTNGKT TTTEMLGAML AAGGRRSTTA GNIGTPIVDA VAAEPPYETL AVELSSFQLH YTHTMAPLAA AVLNVAPDHL DWHGGAAAYA AAKAGIWRRP GTTAIGNADD ATSADLLAAA PGRRVLFGLD PAARPRPGLT VVDGHLVDDA FGGGRLVAVR DLVLTSPHMI SNALAAAALA RAEGVGPAAI GAALVAFRPG AHRNAEVAVI DGVRWVDDSK ATNPHAAAAS LAGYPSVVWI AGGLNKGLAF DDLVRDARRV LRAAVLIGRC ADEIAAALAR HAPDVPVERA DGMDDAVKAA AAFATTGDTV LLAPAAASMD MFRDYAARGD LFAAAVRARE G
|
| |