Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2161 |
Symbol | |
ID | 5670561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2591904 |
End bp | 2593292 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241082 |
Product | glycosyl transferase family protein |
Protein accession | YP_001506503 |
Protein GI | 158313995 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGAG CGGGAGCCGC GGCGTTGCTG GTAACCCTCG CGGCCACACC GGTCGTGCTC GGCGCCATGC GACGCCTGGC CGCCATCGAC GATGTGAACG AACGGTCCTC ACACCAGGTA CCGACCCCGC GGGGCGGCGG CATCGCGGTA GCCCTCGGCC TCTTCGCCGG CGTAGTCGTC CTCATGCTCG CCAGCGGTCA CGGCGACGCC CCCGACCTGC TCCCGATGAC CGTCGGGGTG ACACTGTTCG GTCTCATCGG TCTCGCCGAG GACATCGGCG GCGACGTGGG CGGCATAGCC CCACTGCGCC GACTGGCACT GCAGCTGCTC GCCGGCCTGG CGGTGTCGAC CCTGCTCCTC ACCAGCGCGT CCCTCGACGC CGTGTCGATC CCACCCGTGG TGGTGCTGTC TGCCGCCGCA CTGATCGGCC CGCTGTGGGT GACGGGGTTC GTCAACGCCT TCAACTTCAT GGACGGGATC AACGGCATCT CGGCCGCGCA GGCCGCAGTC GCCGGCGGCG CCCTCGTCGT CGTGGGCCAT CTCCACGACG CGCCGGCCCT GGCTGCGGGC GGCGCCGTTC TTGCTGGGGC GGCGATCGGC TTCGCGCCGT TCAACTTCCC CCGCGCGCGG ATCTTCCTGG GCGACGTCGG CAGCTACACC CTCGGCGCCA GCCTCGCCGT ACTCGCGTTG CAGGGCGTCA TGTCCGGCAT CCCGGCCGAG GCGGTGCTCG CGCCGATACT GCTCTACCTC GCCGACACCG GCGCCACGCT GGTGCGTCGG GTCCGGCGGG GCGAGCGTTG GTACCTGCCC CACCGCACTC ACACCTACCA GCGGCTCACC GACGTGGGAT GGACGCACAC CCGGGTCACG TTGACGGTTG CCGGTCTCGT CGCGGCGATG TCGGGTCTGG GGCTGCTTGG CACGCGGGGT GGCACGGGTA GCCGAGTTGT GGCCGACCTC GGACTACTCG CCCTGGCCAC GGGCTACCTG AACGCTCCCC GGCTGATCGC ATCCGCCAGC GCGCGGATAC ACCCAGGACA GCCCGCTGGC ACCCGTCCGG CGTCGCCTCC AGGCCCTGTC GCAGAGCCCG ACCCAGCCGC CGCACCACCA CGTGCCGCAC CACCACTCGG CGCGTCACCA CTCGGCGCGT CACCGCCCGG CAGTCCACCG GCCGCCATCG CGGTGATAAA TGCCATGGCA GTAGGTCGGG CGCCGGCGGG GGACGGCGCC GGGAGGCTCG TTCTGCCACG CCAGCGACAG GCGGGCGATC CAGGGCTCAC CACCGATCCG CCCGAACCCG CCCGGATGAC CCAGCTACCC GACCAGGCCG GCGCCCGGCG ATACGGAGAT CAGGGAGATA CGGGAGATAC GGAGATCAAA GACGCCTGA
|
Protein sequence | MLGAGAAALL VTLAATPVVL GAMRRLAAID DVNERSSHQV PTPRGGGIAV ALGLFAGVVV LMLASGHGDA PDLLPMTVGV TLFGLIGLAE DIGGDVGGIA PLRRLALQLL AGLAVSTLLL TSASLDAVSI PPVVVLSAAA LIGPLWVTGF VNAFNFMDGI NGISAAQAAV AGGALVVVGH LHDAPALAAG GAVLAGAAIG FAPFNFPRAR IFLGDVGSYT LGASLAVLAL QGVMSGIPAE AVLAPILLYL ADTGATLVRR VRRGERWYLP HRTHTYQRLT DVGWTHTRVT LTVAGLVAAM SGLGLLGTRG GTGSRVVADL GLLALATGYL NAPRLIASAS ARIHPGQPAG TRPASPPGPV AEPDPAAAPP RAAPPLGASP LGASPPGSPP AAIAVINAMA VGRAPAGDGA GRLVLPRQRQ AGDPGLTTDP PEPARMTQLP DQAGARRYGD QGDTGDTEIK DA
|
| |