Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6499 |
Symbol | |
ID | 5674814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7900977 |
End bp | 7903238 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245347 |
Product | glycosyl transferase family protein |
Protein accession | YP_001510742 |
Protein GI | 158318234 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.945206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.608011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACCT CACCGACGCC GTCACCGCCG ACCGACCAGT TCGACCACGC GGCCTACGCG AGGGCTGACC GTGCGGCCTA CGCGAGGGCT GACCACGCGG CCTACGCGGG GGCTGACCAC GCGACCGTTC CGGGCACGCA CTGGCCCGCC GGTGCTCCCG GGCGGGCGGG CGGGGCGTCC GGGCGGCATC TGGCCGGCGA CGGTCGGATC GGCCGGTTCG CGCGGGGACG TCCCGACGAC CCGCGGTGGG TGCGGCCCGC GCTGCTCGGG CTGCTCGCGG CGACGGCCGT CCTCTACCTG TGGGGGCTGG GCGCCTCCGG CTGGGCGAAC GCCTTCTACT CGGCGTCGGT GCAGGCCGGC TCGGTGAGCT GGAAGGCGAT GTTCTACGCC TCCTCGGACG CCGGGAACTC CATCACCGTC GACAAGCCCC CGGCCTCGAT CTGGGTGATG GCGCTCTCCG CGCGCGTCTT CGGCGTGAAC GCCTGGAGCA TCCTCGTCCC GCAGGCGCTG ATGGGCGTCG CCACGGTCGG CCTGCTCTAC GCCGCGGTGC GCCGGGCCTT CCCCGCCGGC GCGGCGCTGC TCGCCGGGGC CGTCCTCGCG ATCACCCCGG TCGCGACGCT GATGTTCCGG TTCAACAACC CGGACGCCCT GCTCGTCCTG CTGCTGGTCG CCGCGGCGTA CGCGACGCTG CGCGCCGTCG AGACCGCGAG TACCCGCTGG CTGGTCTGGG CCGGGGTGTT CGTCAGCTTC GGCTTCCTGA CGAAAATGCT GCAGGCGCTG CTGATCGTCC CGGTGCTGGC CCTCGTCTAC CTCGTCACCG CGCCGACGCG GTTCACCCGG CGGCTGTGGC AGGTCGGAGC GGGCGCGCTC GGGCTGATCG TCCCCTCCGG GATCTTCATC GCGATCGTCG AGCTCGTCCC CGACTCGGCC CGCCCGTACA TCGGCGGGTC GCAACACAAC AGCATCCTGG AGCTGACTCT CGGCTACAAC GGCCTGGGCC GGCTGACCGG TAACGAGTCC GGCAGCGTCG GCGGCGGCGG CGCGGCGGGC GGCGCCGGTG GTGGCGGCAT GTGGGGCTCG ACCGGGTGGG GCCGGATGTT CGGCTCCGAG GTCGGCGCCC AGATCTCCTG GCTGCTGCCT ACCGCGCTCG CTCTGCTGGT GGCGGGCCTG TGGATCACCC GGCGCGCTCC CCGCACCGAT CCCGGACGGG CCGCGCTCGC GGTCTGGGGC GGCTGGCTGC TGGTCACAGG CATCGTGTTC AGCCAGATGC AGGGCATCTT CCACGCCTAC TACACGGTCG CGCTCGCCCC GGCGGTCGGC GCCGTGGTCG GCATGGGAGC CGCGACCCTG TGGCGCCGCC GTGAGCACCC GATCGCCGCG GCCACCATGG CCGGCATCCT GGTACTGACC GCGCTGTGGT CCTACGTCCT GCTCGACCGC ACACCCGACT GGAACCCGTG GATGCGCTGG GTCGTGCTGA TCGTCGGCTT CGCGGCGGCA CTGCTGCTCA TCGTGCTCTC GCGGCTGCCC CAGGCGGCCC GCGTGGCGGT CGTCGCGGCC GCGCTGGTGG CGGCGCTGCT CGGGCCGTTC GGCTACTCGG TCGCGACCGC GGCCACCCCG CACACCGGCT CCATCCCGTC CGCGGGCCCC GCAGGCTCGG GCTTCGGCGG CCCGGGCGGC GGACCTGGCG GAGGCGGCGG ACGGCAGCTG TTCGGCGGTC CTGGCGGCGG CAACGGCGGG GCCCAGCAGG GCACGACGCA GGTCCCCGGC GGCGGCACGG CGCCCGGCGG TACCACGCCG GGTGGGGCGG CTCAGGGCGG CACGCAGGGC GGGCCCGGCG GCGGCATGGG CGGTGGCATG GGCGGCCTGC TCGACGCCGG TAAGCCCAGC GATGAGGTCC TCGCGCTGCT GAAGGCGGAC GCCTCGTCCT ATACCTGGGT GGCGGCGTCC GTCGGGTCGA ACACCGCGGC CGGCTACCAG CTCGCCAGCG GCGATCCGGT GATGGCGATC GGCGGCTTCA ACGGCAGCGA CCCCTCCCCC ACCCTTGAGC AGTTCAAGCA GTACGTGGCG GACGGCCGCA TCCACTACTT CATCGGCGGC GGCGGCTTCG GCGGGCAGAA CGGCGGAAGC CGGGCGTCCA GTGACATCGC CGCCTGGGTG GCGGCGAACT TCACCGCCAC CACCGTCGAC GGCACAACCC TCTACAACCT GACCACCCCA ACCACCACCG GCACCACCAC CAGCGGCACC ACCACAGCCT GA
|
Protein sequence | MPTSPTPSPP TDQFDHAAYA RADRAAYARA DHAAYAGADH ATVPGTHWPA GAPGRAGGAS GRHLAGDGRI GRFARGRPDD PRWVRPALLG LLAATAVLYL WGLGASGWAN AFYSASVQAG SVSWKAMFYA SSDAGNSITV DKPPASIWVM ALSARVFGVN AWSILVPQAL MGVATVGLLY AAVRRAFPAG AALLAGAVLA ITPVATLMFR FNNPDALLVL LLVAAAYATL RAVETASTRW LVWAGVFVSF GFLTKMLQAL LIVPVLALVY LVTAPTRFTR RLWQVGAGAL GLIVPSGIFI AIVELVPDSA RPYIGGSQHN SILELTLGYN GLGRLTGNES GSVGGGGAAG GAGGGGMWGS TGWGRMFGSE VGAQISWLLP TALALLVAGL WITRRAPRTD PGRAALAVWG GWLLVTGIVF SQMQGIFHAY YTVALAPAVG AVVGMGAATL WRRREHPIAA ATMAGILVLT ALWSYVLLDR TPDWNPWMRW VVLIVGFAAA LLLIVLSRLP QAARVAVVAA ALVAALLGPF GYSVATAATP HTGSIPSAGP AGSGFGGPGG GPGGGGGRQL FGGPGGGNGG AQQGTTQVPG GGTAPGGTTP GGAAQGGTQG GPGGGMGGGM GGLLDAGKPS DEVLALLKAD ASSYTWVAAS VGSNTAAGYQ LASGDPVMAI GGFNGSDPSP TLEQFKQYVA DGRIHYFIGG GGFGGQNGGS RASSDIAAWV AANFTATTVD GTTLYNLTTP TTTGTTTSGT TTA
|
| |