Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5320 |
Symbol | |
ID | 5673654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6408825 |
End bp | 6411098 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244177 |
Product | hypothetical protein |
Protein accession | YP_001509584 |
Protein GI | 158317076 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.323329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.16375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCCAC CAGCCGAAGG GCGGTCCAGG CCGCTCGGTA CCGTGCCGAG GGTGAGCAGG TCGAGTGATC CCGGCGCCGG CGCGGTCTCC GCGCCGGACG AGCCGTCCGA CCGGATCGGC AAGGCCGCCG CGGACCAGAT CGGCACGGTC GTCCCGGACC GCCTGGTCGT GACGCCTGAG CCGCTGCCCC GGCGGCTGTG GCGTCGCTGC CAGGAGGTCA GCACGGCCGG CCGGTACGTG GCGGTCCTGG CCTTGACCAG TGCGACGGTG GCCGTCCTGC TGCGGCACAA CCTGTTCCCG TACCTGTCGG TCAACAACGA CGAGGTCATC TACCTCCTGC ACGCCCGGAC GCTGGCGGAC GGCCACCTCT TTCCGTCTGC GCCTGATCCA GCCGCGTCGT ACGCACCCTG GCTCGCCGCC ATCTCGGGTG ATCACTTCGT TCTGAAGTAC ACGCCGTTCG TCCCGGGGTT GTTCGCGTTG GGCCTCATGC TGACGGGCAG CGTCTCGCCG GTGCTGGCGG TTATCGCGGC GGCGGCGGTG ATCGTTACCT ACCTGCTCGG TGTCGAGCTG GCGGGGGAGC GGAGGGTCGC GGCGCTGGCC GCGACGCTGC TGGCCCTCTC CCCGTTGGTG ATCGTGCAGA GCGCGCTGGT GCTGAGCTAC CTGCCGGTGC TCGTGCTGAT GGAGCTGACG TTGCTCGGCC TGCTCCGAGG GCTGAGAGCC GGCGGGCTGA GAGCCGGCGG GCTGAGCGAT GGCGGTCGGC GCTCCGCGCG GCACGGCGGG CGGGCGCTTG CCGGAGCGGG GCTCGCGGTC GGCGTGGCCG TGGCGGTGCG GCCGTATGAC GTGGTCCTCC TGCTCGCCCC GGTGGCGGTC TGGGGCGTCG TGACGGCGCG CAGGTCCGGG CGGCTCGGGT GGGCCCTGCG CTGGACGGCT GCCGGGCTCG TGCTCCCGGC CGCGATCCTG CTGGCGTCCA ATGCCGCGGC GACCGGCAGC CCGTTCAGGT TGCCGTTCGC GCTGCTCGAA TCGGACGACA AGCTCGGGTT CGGCGTGCGC AGGCTGTACC CGTCGGACGG CGGGCACGAC TTCGGGCTCG GTGACGGACT GGCGTCGGTC GGGGATCATC TCTGGCTGCT CGGTGGCTGG GCCTGCGGCG GGGTGGTTCT CGCCGTCGCC GCGATCGTGG CGGCCGCGCG GCGCCGGCTG AATGGCCCGG GGTACGCCCT CGGCGTCGGC ATGGTGCTGT TCGTGGTCGG TTACATCGGC TTCTGGGGGG CGTGGAACGC GGCGGAGCTG TGGGGCGGCA TCCGATACGT CGGGCCGTTC TACCTGATGC CTGTGCTGAT CGGGCTCGTG CACCTCGGTG CGCGCGGGCT GGTCGACCTC GCCGGCTGGT CACGGCGGCG GGCGGCACGG ACGGTGACCG GGGTGTGCGC CGCGGGTGTC GTCGGGCTGA CGACGTTCGT CCTCGTCGGC GCGATCGACG CGAACGCGAC GATGACGGAC CACGACCGTG ACCTGGCGGC GATGCTGCGG GCCCTGCCCG GGCGGTCCCT GGTACTGGTG GCCGCCAGCC CGCCCTACCT GGGCCATCCG AGCGGTGTCA CCACCAACGG GGCCGATCTC GACGGTGCCG ACGGTGACGG GCCGTTGCTG TTCGCGGTCT CGCGGGGGGT GGCCGACCTG GAGGTCGTCG CCGACCACCC CGACCGCACG CCGTACCTGC TCCGCATGCC GCCGGCCTAC AACCGGTCCC CGGGCTCGGT GACGCGCTCG CGGGTGGACG CGCTCACGGT GGCCACCGGT CGTACGGTCG GCGTCGAGGT CAGCGTGGAC GCGCCGCCGC GAGGGACGCG CGCGGCCCAG CTGGTGTTCG AAGCCGGTGG GGTCCGGCTG ACCTACCCGG TCTCAGCGAA CGGGCCCGTG ACCGCGCGGC TCACCCTTGA CGCGGACGGC CTCGACACCG ACGACGTCAC GGAAGTCGTG ATCTACGGTG GCGGCGAGAC CAGGGGATCT CCGGCGGGCG CGGCCACCGG GCGCCCGGCC GGCCGGGCGA AGATCACGAA GGTGCCCGGG GTGGGGACGT CCGTGACGGT GTCGCTGCTG GCCATCCCGG CGTCCGGTGG CCGTGCGCGG ACCGTCGACC GGCAGGTGAT CCCCGTCCTC GTCGAGGATC CCAGCGGCGA GACGCCGGGC GATGTGGCAG TCCTCGCACC GAGCGCCCAC GTGGACGAGA CCGGCCAGGG CCCGCGGCCC GCCGTCCGCA TCGCGCTGTC CTGA
|
Protein sequence | MGPPAEGRSR PLGTVPRVSR SSDPGAGAVS APDEPSDRIG KAAADQIGTV VPDRLVVTPE PLPRRLWRRC QEVSTAGRYV AVLALTSATV AVLLRHNLFP YLSVNNDEVI YLLHARTLAD GHLFPSAPDP AASYAPWLAA ISGDHFVLKY TPFVPGLFAL GLMLTGSVSP VLAVIAAAAV IVTYLLGVEL AGERRVAALA ATLLALSPLV IVQSALVLSY LPVLVLMELT LLGLLRGLRA GGLRAGGLSD GGRRSARHGG RALAGAGLAV GVAVAVRPYD VVLLLAPVAV WGVVTARRSG RLGWALRWTA AGLVLPAAIL LASNAAATGS PFRLPFALLE SDDKLGFGVR RLYPSDGGHD FGLGDGLASV GDHLWLLGGW ACGGVVLAVA AIVAAARRRL NGPGYALGVG MVLFVVGYIG FWGAWNAAEL WGGIRYVGPF YLMPVLIGLV HLGARGLVDL AGWSRRRAAR TVTGVCAAGV VGLTTFVLVG AIDANATMTD HDRDLAAMLR ALPGRSLVLV AASPPYLGHP SGVTTNGADL DGADGDGPLL FAVSRGVADL EVVADHPDRT PYLLRMPPAY NRSPGSVTRS RVDALTVATG RTVGVEVSVD APPRGTRAAQ LVFEAGGVRL TYPVSANGPV TARLTLDADG LDTDDVTEVV IYGGGETRGS PAGAATGRPA GRAKITKVPG VGTSVTVSLL AIPASGGRAR TVDRQVIPVL VEDPSGETPG DVAVLAPSAH VDETGQGPRP AVRIALS
|
| |