Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5863 |
Symbol | |
ID | 5674186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7112784 |
End bp | 7115963 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641244713 |
Product | hypothetical protein |
Protein accession | YP_001510115 |
Protein GI | 158317607 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0962596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.213292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGAC CAACGCGGGA TGCCGACCAG CACCGGTGGG CGGTTCGTTC CCGAGGAGCC GTTCGTCCCG GTCAGCGGGT CGTAGCGGTC GTCGTGACGC GGCGGGATCA GCCCGCGGCG CCCGGCGGCC ACGACCCGCG ACGAAACGGT CTCGGCAGTA CTGGCACCGC TGCGGCGGTC GCTGCGGGCC CGGTGGGTCC GGTGGTCGCG CTGACCGCGT CGACCCGCCC GCCCGAGCGG ATCGTGGTCG TCCGTCTCGG CGGCCCCGCG ATCCCGGCGC AGCGAATCCC GGCGCAGTGG GTTCCGGCGC AGTGGGTTCC GGCGGCAGAC AGCCCCACGC AGCCGAGCCC GGACCCTGAC GTCCCGAGCC TTCCCGCCCG AGACGTCCCG GGCCTGCCCG CTGGGGACGT CCTGAGCCTG CCTGCCGGGA CGTCCTTCGG TGTCGCGGTC GCGGCCGGTA TCCGGCACGG CGGGCTGGGC GGCGCGCCCG ACACCTTCTA CTGGCTGATC CACGACGGCG TCCTGCCACG GCCCGACGCC CTGGAACGAC TGCTCACCTA CGCCCGCGTC GACCCGGGCG CGGCGGTCCT CGGTCCCAAG GTGCTCGACG CCGCCCGACC GGAATTCCTG CTCGAGGCGG GTGTGACGGT CGACCGGGCC GGCCGGCGGA TCACCGGTGT CGTCCCGGGC GTGCCCGACC ACGGCCAGCA TGACGCCGTG CGGGACGTGC TGGCCGTCTC CTGCACGGGG ATGCTCGTGC GAGCCTCGGC ATGGGAGCGG CTCGGCGGCC TGGCAGCGGA CATGACCGCC GGCCTCGATC TGGATCTCGG CGCCCGCGCG GCGCGGGCGG GCCTGCGGGT CGTCGTCGTG CCCCCGGCGG CGGTGCTGCT CGCGCCGCCC GCACGTACCG ACGCGCCCCT CGCTGACAGG CCCCTCGCGT CCGCCGCGTC CCGCGCGGGC GACCGTGTCG GGAGGGTCGC GGGAGCGCGG GTGCGACTCG CGCTGACGGC GGCGCCCTTC CTCGTACCGG CCGTGCTCGC GTTGGTGGTC GCGGGCGCGG CGCGGGCCGG TGCCCGCCTG CTCCTCCCGC CCCGGGACAG CGGTGATCTG AGCGGCCCGC GTTCCCGCGG CCGGTGGCGC GACGCGTTGA CCGAGCTGTG GATCGCCGGT GCCGTCCTCG CCGGCCCGCG GCCCCTGTTC CGGATGCGCG TGCGGTTCGG CCGGTGCGTG ACGGTGCCGC GGCGCGCCGT GCGCGAGCTG CTGTCGGCCC GACCCCGTGC GGTCGCCGAC CGGGCGCCGA GTGGCCCCCG CGCGGCCGCC CCGCCGCGGG CGCTCGCGGT GGTGATGGCC CTGTTCCTGG GGGCGGCTGG CCTGGCCGTC CGCCGGTTGC CACCGGACGC CGTCGGCGGT GGCGTGCCGA TGCCCGAGTA CGCCGGTGAC CTGTGGTCCG TGGTGTGGTC CGGTTGGCAG GGCTCCGCGG GCGGCGTGCT CGGCTGGCCG GGCTCCGCAC CGCCCTGGAC GGCTCTGCTC GCCGCGCTGT CGTCCGTCGC CGCCCCGGTC GGTCTCAGCG TGGCCGCGAC CTGCTCGGTC GTGCTCGTCG TCGCTCCGGC CGCGGCGACC TACCTCGCGT ACCGGGCCTC GGGACGGCTC GTCGGCTCGC GCGGCCCGAG GCTCGGCCTG GCGGCCCTGT ACGGCGTGTC CCCGCCGGTC ACGGGATCAG TGCTGGCCGG CCGGATCGAG ACCGCCGTCG CCCTGGCGGT GCTGCCCGCC GTTCTCGCGG CGGGGGACGA GTTCCTCCGT GGCCGGACGG TCCGGGCCGG AGCGAGCGGT TCCGGTTCCG GAACCGGCCT CGACGCCGAC GCCGGTGGGC GCGGGCGCGC GGGCTGGCGG CTCGCCGTCT CCCTGGCCCT CGTCGTCGCG TGCATGCCGG TGCTGGCACC GATCGCGCTG GTCGGCCTGC CCACGGCGGC CTGGGCCGTC CGGAGATCAC GGCCGCCGGG CACGCAACCG CCGCCGGTCG GGCCGGTGCC GGTACTTGGT GTGCTGATCG CGGGAGCCGT CCCCCTGCTG CCCGGCCTCG TCACGGGGGG CGTCGGGTGG TGGCCCGCCG CTGTGTCGCC CCTGTCGGGC TCGGAGCTGA CCGGCCTGGT GGCCGGCGCT CCCGGTGACG AGAGGCGGGC CGCGGCGCTC TGCCTCTTCG TGGCGCTGTG CCTCTTCGCG GCGCTGTGCC TGCTGCTCAC CGCCTGCCCC CGCCTGCTCG GCCGTCGGGC GTCCTCCGGG GTCGGTGGTG TGGCCACCGG CTGGGCGCCG GCCACGGCGC GGGTCGGGGC GGTCCTGCTC GGCGGATGTG TGCTGGCGTT CGCCGCCGGC CTGGCCACCG CGACGCCGAT TCCGGGTCCC GCGCAGGAGG AGAGCCCGGC CCGCGCGGTC GCCGCGCTGA GCCGGACGGG CGGACCAGGC GCGCGCGTCC TGGTGCTGCG CCGGTCCGGC CCGTCGGGGC CGGTCGCGTA CAGCCTGGCC GCCCAGGGCG GGCCGAGGTT CCCCGCCGCC GCCCCGCACC GGCCGCCCTC CGCTGCCGAC CGGGCGCTCG CGACGCTCGT GGCGGACATC TCCGCCGGTC TCCCGGACGC CGCCGACGCG CTGCCGGCGT TCGGTGTGGC CGCGGTCGTC GTCCCGGCGG GTTCGGCCGA CCCGGCGCTG GTGGCCGCAC TCGACGCGGT GGACGGCCTC TCCCGGGAGA GGCGCGGCCC TGACGTGCTG CTGTGGCGCC CGGTCGCCGC CGCCCCCGGT GCCGGGGAAG GCACGACCCT GCTCCGGCTC GGCTCGCCCG GTCCGGCCGG ATCGACCCGG CTGCCCGGCG GCGCCGCGGG GCGCCGGGTG GTGCTCGCCG AACCCGCCGA TTCCGGCTGG CGGGCGACCC TGGACGGCGC GCCGCTGCCC GCGGCGGTCG CGGACGGCTG GGCGCAGGCG TTCGTCCTGC CGGCCGAGGG CGGCCTGCTC GAGGTCTCCT ACGATCACCA CCGCCACCGC GCCGCCGTGC TGGCGACCGC CGCAGCCGGG GCGCTCCTGC TGGCCGCGGG CCCCTTGGCC GCCGTCCCGC GCCGGCGCAG TGGCCGCCGC GCGGCGGCCA CGCGGGCGGC GGCCACGCGG GCGGCGGCCA CGCGGGCGGA GACCACGTGA
|
Protein sequence | MTRPTRDADQ HRWAVRSRGA VRPGQRVVAV VVTRRDQPAA PGGHDPRRNG LGSTGTAAAV AAGPVGPVVA LTASTRPPER IVVVRLGGPA IPAQRIPAQW VPAQWVPAAD SPTQPSPDPD VPSLPARDVP GLPAGDVLSL PAGTSFGVAV AAGIRHGGLG GAPDTFYWLI HDGVLPRPDA LERLLTYARV DPGAAVLGPK VLDAARPEFL LEAGVTVDRA GRRITGVVPG VPDHGQHDAV RDVLAVSCTG MLVRASAWER LGGLAADMTA GLDLDLGARA ARAGLRVVVV PPAAVLLAPP ARTDAPLADR PLASAASRAG DRVGRVAGAR VRLALTAAPF LVPAVLALVV AGAARAGARL LLPPRDSGDL SGPRSRGRWR DALTELWIAG AVLAGPRPLF RMRVRFGRCV TVPRRAVREL LSARPRAVAD RAPSGPRAAA PPRALAVVMA LFLGAAGLAV RRLPPDAVGG GVPMPEYAGD LWSVVWSGWQ GSAGGVLGWP GSAPPWTALL AALSSVAAPV GLSVAATCSV VLVVAPAAAT YLAYRASGRL VGSRGPRLGL AALYGVSPPV TGSVLAGRIE TAVALAVLPA VLAAGDEFLR GRTVRAGASG SGSGTGLDAD AGGRGRAGWR LAVSLALVVA CMPVLAPIAL VGLPTAAWAV RRSRPPGTQP PPVGPVPVLG VLIAGAVPLL PGLVTGGVGW WPAAVSPLSG SELTGLVAGA PGDERRAAAL CLFVALCLFA ALCLLLTACP RLLGRRASSG VGGVATGWAP ATARVGAVLL GGCVLAFAAG LATATPIPGP AQEESPARAV AALSRTGGPG ARVLVLRRSG PSGPVAYSLA AQGGPRFPAA APHRPPSAAD RALATLVADI SAGLPDAADA LPAFGVAAVV VPAGSADPAL VAALDAVDGL SRERRGPDVL LWRPVAAAPG AGEGTTLLRL GSPGPAGSTR LPGGAAGRRV VLAEPADSGW RATLDGAPLP AAVADGWAQA FVLPAEGGLL EVSYDHHRHR AAVLATAAAG ALLLAAGPLA AVPRRRSGRR AAATRAAATR AAATRAETT
|
| |