Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6483 |
Symbol | |
ID | 5674798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7881303 |
End bp | 7882808 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641245331 |
Product | hypothetical protein |
Protein accession | YP_001510726 |
Protein GI | 158318218 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.159074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACGA TGTCCCGCAG ACGCTCCGCA CGCGGCCCGC GCCGCGGCCC GCCTCGCCCG GCGGGCCGTT CCGGGCCGGC GGGAGCCGCG GAACGCCACG AGTCACCGCC GGCGGCGGGT CACAACGACC CGCCGCCGGC GCGGCACGAC CCGCCGCAGG CGCGGCTCCT GGTGGCGGCG GTCTGCGTCC TGGCGATGGC GGCGCTGGGC CTCGGGCTCT TCTCCGATGT CCTGACCCGG CACGTCCTGA CCCTCGGCCC TGGCCGGGCC GGTGGCCCGC CGCCCGCAGG CAGCGGCGGG GGGCCACCCA CGCGGGGCGC CACGACGGAA GAGGCGGAAC AGCCCCCCTG GGGCGCTGCG ACCGGTCAGC GGCCGGGGCC GGTGTGGCCA CCGCGGGCCG GCAGCCCGTC CGATGCCGGG GACACGACCG AGCTCACCCT GACCTTCAGC GGCGATCTGG TCCTCGACCC GTCCAGCGCC CGCGCGGCGC TGGCACCGCT CGGCAGCCTG TTGTCCTCCG CCGACCTCGC GATCTGCCGC GGGCCGGCTC CCACCGCCCA CCCGGAGGTC ATCGCCGAGG CCCTGCGGCG GGTGGGCTTC GGCGCGTGCG CCACCGCGTC CGGCCGGGCG GCCCGGCTGG GCGGTGCTGG GGTGCGCGGC CTGCTGGACG CCCTCGACGG CGCGGCGATC GACCACAGCG GCACGGCCCG CGAGCCGCTG GACGCCGCCA CCCTGTCGCT GCTGCCCGTG CGCGGCGCGC AGATCTCGCT GCTGTCCTAC ACCGAGGACG CCGGCACCGA CCCCGCTCCC GGCTCACCCG GAGCGGACCC GCCCGGCTGG ACGGTCAACG AGCTGGACCC GGCGCGGATC CTGCGGGACG CCGCCCGCGC CCGCCAGGCC GGCGCCGACC TCGTCGTGGT CGCGCTGTCC TGGGCCCCCG ACCAGGCCGA ATCCGCGCGG AGCGCGCCGA CGGGAACAGC GCCCACGGGG ACGGCTCCGA CGCAGCGGCA GCGGATGACC GCGCGCGAGC TGCTCCGCTC CCCGCTCGTT GACCTGGTCG TGGGCACCAG CGCCGGCACG GTGCGGCCGG TCGAACGCGT CGACGGCAAG TACGTCGCCT ACGGGACGGG CTCGATCACC ATCCCGGCAG CGGGCGGCCT CAGCGGCGGT GCCGGCGGCG TCCCTGGCGG GGAAGCAGGC GCCGAACCGG GCGTGGACGC CGCCGGGCGG GACCGGGAGC GGGACGGCGC GCTCCTGCAC GCCCGGGTAC GGCGCACGGC GCTCGGCTGG ATGGTCGTCG GTCTCACCTA CAGCCCGATC TGGACGGGGC CGGACGGCGT CGTCCGCCCG GTAGCGGACG CCCTCGACGA CCCGGGCACG TCCGAGGCGG CACGGGCCGA GCTGACGGTG TCCTGGCTGC GCACCGTGGC CGCGCTGACC TCGCTGGGGC AGGTCGACGG GGTCCGTCCG GAACGGGTGC CGCGCCAGCC CGGGGCCGGT GCCTGA
|
Protein sequence | MGTMSRRRSA RGPRRGPPRP AGRSGPAGAA ERHESPPAAG HNDPPPARHD PPQARLLVAA VCVLAMAALG LGLFSDVLTR HVLTLGPGRA GGPPPAGSGG GPPTRGATTE EAEQPPWGAA TGQRPGPVWP PRAGSPSDAG DTTELTLTFS GDLVLDPSSA RAALAPLGSL LSSADLAICR GPAPTAHPEV IAEALRRVGF GACATASGRA ARLGGAGVRG LLDALDGAAI DHSGTAREPL DAATLSLLPV RGAQISLLSY TEDAGTDPAP GSPGADPPGW TVNELDPARI LRDAARARQA GADLVVVALS WAPDQAESAR SAPTGTAPTG TAPTQRQRMT ARELLRSPLV DLVVGTSAGT VRPVERVDGK YVAYGTGSIT IPAAGGLSGG AGGVPGGEAG AEPGVDAAGR DRERDGALLH ARVRRTALGW MVVGLTYSPI WTGPDGVVRP VADALDDPGT SEAARAELTV SWLRTVAALT SLGQVDGVRP ERVPRQPGAG A
|
| |