Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6403 |
Symbol | |
ID | 5674718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7772815 |
End bp | 7775067 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245251 |
Product | hypothetical protein |
Protein accession | YP_001510646 |
Protein GI | 158318138 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.187686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTCG ACTCCCCGGT CGGCGCGTCG ACCGGAGGTG GTCCGACCAC GGCGGACAGC ACCGCCGGGA CGCTCCCGGG TGCACCCGTC GACCTGGCCG ACGGCACCGG TCACGCGGAG CCGTCCGGAG CGGGCACGCA CGCAGGCCCA CGAGCCCGGC GGTTCCGGTG GTTCCGGCAA AGCCGGCACT TCCTGCGGGT CCAGTCGTTC CGCTTCCGGT CGCCGGCCGG CGTCGTCGCT GTGGTCGCGC TCCTGTCCTT CGTCGCGAGC CTGGTCCTGC GGGCCACGCT GTACCCGGAC GGCTCCGGCG ACGCGGACGA GGCCGCCTAC ATCCTCCAGG CGCGGATGCT GCTCGAGGGG CGGCTGACCC TGGACGCGGG CGCCGTCGAA CCGTTCTTCC GGCCGTGGCT GACGGGTGTG CACGACGGTC ACGTGTTCAC CAAGTACCTG CCCGGCTGGC CGGCGTTGCT CGCGCTGTCC CAGGTGCTGT TCGACACGAT GGCCGTGGCG CCCGCCGCCG TCGCCGGGGT CTGGGTCGTC GGGACCTACC GGCTTTCCCG CGAGCTCTTC GACCATGCCT GGAGCGCGGT GGCCGCCGCT GTGGGCGTCG CGCTCTCACC GCTGGTCCTC CTGCATACGG CACTGCCACT CGCCTACGCC CCGGGGGCGG CCGTGCTCGT CCTGGCCAGC GCGGAGCTGT TGCGCGGTGC ACGGACGGGG GCGCGGGCCG CCCTGGCCGG CGGCGGCGCG GGCCTGGGGC TGGTGCTGCT GATCCGGCCG TTCGACGTCG TGCTCGTGCT CCTCCCGCTG GCCGCGCTCG CGGCCGCGCG GCGCCGGCGG GAGCTCGGGA CGCTGCTGCG GCGCTCGGGC TGGGCGGTGC TCGGCGCACT GCCCCTGGTC ACCGCCCTGC TCGCCTACTG CTGGCGGGTC ACCGGATCGC CCCTGCGTAT GCCGCTGTCG GCCTCGGACC CACTGGACCG GTTCGGTTTC GGCCCGCGCC GGATCCTGCC GTCCGAGTCG AGTTTCCTGT TCACCCGCCG GCTGGCGCTC GACGCCCTCC AGGAGACGCT CGAGGTGGCG CCGAGCTGGT TCTTCGGCGG TGCCGCGCTG ATCGCGCTGG CCGCCGTCGG CCTGGTGGCG CCGCGGCGGC GACTCGAGCG CCTGTTCCTG CTCGCGACGA CGGGCGTGGT GCTGGCCGGC TACACGTTCT GGTGGGGGTC GGCGTTCGCC ATGCCGGGCC TGCGCAACGG CCTCGGCCCG CACTACCATC TGGCCGCGTT CACCCCGGTC GTCATCCTCG CGGCCGACGG CGCCCGATGG CTGTGGACGT TCCTCCCGGC ACAGTTGCCG CTGTTCCGCC GCCCGGGGGC ATCCGTGCCG GGTACCGCGC GCGGCCTGGC CTGGCGCATG GTCCGTCCGG GGGCTGTAGC GCTCGCCGTC GCCGGGCTCG TCGCGATCAC GGTGCCGACC CTGCAGCCCC GGATCGACGT GCAGCGCGGG GTCAACGAGG GCAACGACTT CCTGGCCGCG CTGCTCCCGG ACAACCTGGG CGGGCCGGCC GTGGTGCTGG TGACGCCGAC AGTCCCGAGC CGCTACACGC AGGTTCCCTA CCATTCGCTG CGGAACTCCC CCGACCTCGA CGGACCTGTC GTCTTCGCGG CGGACATCGG GCCGGGCTCG GCCGCGCTGC CTGACCGGAT GCCGGACCGG GCGATGTTCC GCCTGCGGCC GGACGAGATC GCAGACCCGG CGGTCCCGGG CAGCTTCCGG GGGTCCTTCG TGCCGCTGAG GCAGGTCACC GGGAGCCGCG TCGAGATCCG TGTACAGGTA CGGATTCCCG GTGACGCTGG GACGGCACCC GTGCGGTCGG GGGATGCGCG GCTGTACGTC CGCCTCGGCG GGGAGGTCCG CACCCTGCGA ACAGCCGTCC CGGTGACGAT CACGCACACG TTCGTGCTCA CCACCGGTCC CGGCACCGGC CCGGACGAGA TCGGGACCGC CGGCGCGTCG CTGCCCGCCG AGCTCGTCGT CGGGTTCACC GACGGCACCG GCCCGGCGAG CGGGGCCTGG GAGGAGCGGT TTCCCCTGGT ACGCCGCCCC GGTGGCGACC TCTCCCTGCT CGCTCCCGGG CTGGGCTGGC GCCGACTGTC CAAGGTGGCT GGCAGCGGCG CGGCCGGCGG CGACGGCCAG TGGCTCCCGG CCACCGCGAA ACCCACCTTG GACGTCTCGC TGACCGGCGC CGCTGCCCGC TGA
|
Protein sequence | MTVDSPVGAS TGGGPTTADS TAGTLPGAPV DLADGTGHAE PSGAGTHAGP RARRFRWFRQ SRHFLRVQSF RFRSPAGVVA VVALLSFVAS LVLRATLYPD GSGDADEAAY ILQARMLLEG RLTLDAGAVE PFFRPWLTGV HDGHVFTKYL PGWPALLALS QVLFDTMAVA PAAVAGVWVV GTYRLSRELF DHAWSAVAAA VGVALSPLVL LHTALPLAYA PGAAVLVLAS AELLRGARTG ARAALAGGGA GLGLVLLIRP FDVVLVLLPL AALAAARRRR ELGTLLRRSG WAVLGALPLV TALLAYCWRV TGSPLRMPLS ASDPLDRFGF GPRRILPSES SFLFTRRLAL DALQETLEVA PSWFFGGAAL IALAAVGLVA PRRRLERLFL LATTGVVLAG YTFWWGSAFA MPGLRNGLGP HYHLAAFTPV VILAADGARW LWTFLPAQLP LFRRPGASVP GTARGLAWRM VRPGAVALAV AGLVAITVPT LQPRIDVQRG VNEGNDFLAA LLPDNLGGPA VVLVTPTVPS RYTQVPYHSL RNSPDLDGPV VFAADIGPGS AALPDRMPDR AMFRLRPDEI ADPAVPGSFR GSFVPLRQVT GSRVEIRVQV RIPGDAGTAP VRSGDARLYV RLGGEVRTLR TAVPVTITHT FVLTTGPGTG PDEIGTAGAS LPAELVVGFT DGTGPASGAW EERFPLVRRP GGDLSLLAPG LGWRRLSKVA GSGAAGGDGQ WLPATAKPTL DVSLTGAAAR
|
| |