Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7195 |
Symbol | |
ID | 5675496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8785229 |
End bp | 8787895 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641246032 |
Product | putative glycosyl transferase |
Protein accession | YP_001511420 |
Protein GI | 158318912 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.80662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCATACG ACTACGCGAC CTTCTCGACG CTCGCCGGGC CGTTGACGCG CCCGGCCGAA GGTGCGGTGC ACCACGTCGA GTACCGGCGC ATCATGGCAG GTCGTGTGCT CTGGCAGGTC ATGCCGCTGC TCATCGCGAC ACTGGGCCTG TCGTCGTTGT TCTTCGTCTG GCTTCTCCAG CCCCAGCACT ATCCCAGCAC GAAATACATA CCGACGGCGG TGGCCGTCGG GAACCACATC ATGTTCTGGC TGATCGTGGT GACCGAGGGC ATCCGCCTGC TCAGCTCCAT CATCCTGTGC TGGTCATCAG TGATCATGCG CGACCCGGTG CCGGTGCTGC CACCCCCGGG ACTTCGAGTG GCGTTCACCA CGACGATCGT GCCGTCCAAG GAACCCGTCG ACATCGTCCG CGACACCCTG ATCGCCGCCC GCAACATTCA GTACAGCGAG CAGATCGACG TCTGGCTGCT TGACGAGGGC AACGACCCGG CGGTGAAGGC AATGTGCCGG GAGACCGGCG TGCACCACTT CTCCCGCAAG GGCGTGGAGA AGTGGAACAC GCCGAAGGGA CGGTTCCGCG CCAAGACCAA GCACGGCAAC CACAACTCGT GGCTTGACGC GAACGGCCAC AAGTACGACG TCGTGCTCTC CGTCGACCCG GATCACATCC CGCTGCCGAA CTTCGCCGAC CGGATGCTCG GGTACTTCCG CGACCCGAAT GTGGCCTTCG TGGTCGGCCC GCAGGTCTAC GGGAACTTCC GGCACTATCT GACCCGGGGC GCCGAGGCCC AGAACTACAT GTTCCACTCG GTGATCCAGC GGGCCGCGAA CCGCTTCGCG GCCGGCATGT TCGTGGGTAC CAACCACGCC TACCGGGTGT CCACCTGGGA TCAGATCGGC GGGTTCCAGG ACTCGATCAC CGAGGACCTG GCGACATCCT TCGCCGTGCA CGGCGCGTTC AACGAGGTCA CCGGGCACCG CTGGACGTCG GTCTACACCC CGGACGTGGT CGCCGTGGGC GAGGGCCCGG CGAACTGGAC GGACTTCTTC AGCCAGCAGC TGCGGTGGGC GCGGGGCGCC AACGAGGTGA TGGTCACCGA GGCCCCGCGG CGGCTGAAGG CGCTGAGCTG GGGACCGCGC CTGCACTACC TGACGCTGAT GGTGCACTAC CCCACCGTCG CGATCACCTG GATCGTCGGC AACCTGCTCA CCGTGCTCTA CATGGCGCTC GGCTCGACCG GCGTCCTCGT CAACGTCTCG TTCTGGCTGG CGCTCTACGT CGACGTGTTC GTCGCGCGGA TGCTGCTCTA CTTCTGGCTG CGGCGGTTCA ACATCAGCCC GCACGAGGAG AAGGGCAGCG CGGGCATGAG CGGGATCTTC GTGTCCGTGC TGTGCACGCC CTTCTACTCG ACGGCCTTCG TCGGCGCGCT CACCCGCCGC AAGCTCGGCT TCGTGGTCAC CCCCAAGGGG AACGCGGCCA GCCCGGACCG CCTGATGACC TTCCGCAAGC ACCTGTTCTG GGCGGCGGTC TCGGGCGGAT CCGTGGCCGG AGCCGCGGTC TTCGGTCACC TGTACCCGGC GAACATGGTC TGGGCCTCGC TGTCGATCAT CACCTGCCTG ATTCCCATCG GGCTGTGGCT GATCGAGCCG ATCCTGGCCC CGCGGCGGGC GGTCGCTCCC AACCCGTTGC CGGCCGCCCA CATCCCGCCG CAGCGCGCCG GCGACCGCGC ACCGGCCGAC GCCCGCCCCG GCGCCCAGCG CCACGCCGAC CGCATCCCCG CCGGTCGCGG CGGCGTCGAC CCGGCCACGG CCGAGACGTC CACCCTCGAC CCGAGCGCCG CGGACACGAC CGTCATGGGG GCGATCGGCG CAGGGAGCGC CAGTGGGAAC AGGGCCGGCG GGAACGGGCC CGGCGGAAGG GGACCCGGCG GGAAGGGATT CGGTGCCAGG CCGTCCGACG TCGACCCGGC GACCGTCGGC ACGCCGCTCC CGGAGGCACC CGGCACCGAG CGGACGGTTC CGCCTGTTCC CCCGCCGGCG CCGCCGCTCA CCCAGCCGCA CCCGGTACGG CCGGCACGCC CGAACCAGCC TCGGCCCGCC CAGCCGGGAC GTCCCGCCGC ACCCGGGCGG CGGCGTGCCA GGCCGGCAGG CTCCCCGGAG CACGCCGGCA CCGGGCGGGC CGGGTGGGAG CGCGACGATC CGACGCTGAC CGGCCTCGAG CCCGTCGGCG CCCGCGGCGG CGCGGAGGGC GGCGAGGATG CCGAGCAGTC GCGGCCGCGG GACCCGGGCT GGTTCAGCGA CGACACCGTG CGTACGGAGA AGCCGAGGAT CGCGGCCATC GTCGCCGCCG CCGACGCCAA CAGCCCCGGC ACCGGCACTC CTGAGGACAC GGTCCGGACG TCCCGGCCGG TCGTCGGCGC CGTCCTGGCC GCCGCCGGGC TGCGCCGTGA CTCGGCTAAC CCCGACGACG TCGATCAGGA CACGGTCGTG ACTCTCCGTT CCCGGCAGGG GTCGCCGGAG GAGGTCCTCG CCGCCCGGCG GGCGGCGCTC GCCCGGGAGG GCCGCTCCCC CGCCGGCGTT CCGCACTACA CCGAGGACGT CACGATGACC CTGCATCCCC GCCGCCGGCG GGTGTCACTC GACGGCCTGT TGGAAGAGAT CGGCTGA
|
Protein sequence | MAYDYATFST LAGPLTRPAE GAVHHVEYRR IMAGRVLWQV MPLLIATLGL SSLFFVWLLQ PQHYPSTKYI PTAVAVGNHI MFWLIVVTEG IRLLSSIILC WSSVIMRDPV PVLPPPGLRV AFTTTIVPSK EPVDIVRDTL IAARNIQYSE QIDVWLLDEG NDPAVKAMCR ETGVHHFSRK GVEKWNTPKG RFRAKTKHGN HNSWLDANGH KYDVVLSVDP DHIPLPNFAD RMLGYFRDPN VAFVVGPQVY GNFRHYLTRG AEAQNYMFHS VIQRAANRFA AGMFVGTNHA YRVSTWDQIG GFQDSITEDL ATSFAVHGAF NEVTGHRWTS VYTPDVVAVG EGPANWTDFF SQQLRWARGA NEVMVTEAPR RLKALSWGPR LHYLTLMVHY PTVAITWIVG NLLTVLYMAL GSTGVLVNVS FWLALYVDVF VARMLLYFWL RRFNISPHEE KGSAGMSGIF VSVLCTPFYS TAFVGALTRR KLGFVVTPKG NAASPDRLMT FRKHLFWAAV SGGSVAGAAV FGHLYPANMV WASLSIITCL IPIGLWLIEP ILAPRRAVAP NPLPAAHIPP QRAGDRAPAD ARPGAQRHAD RIPAGRGGVD PATAETSTLD PSAADTTVMG AIGAGSASGN RAGGNGPGGR GPGGKGFGAR PSDVDPATVG TPLPEAPGTE RTVPPVPPPA PPLTQPHPVR PARPNQPRPA QPGRPAAPGR RRARPAGSPE HAGTGRAGWE RDDPTLTGLE PVGARGGAEG GEDAEQSRPR DPGWFSDDTV RTEKPRIAAI VAAADANSPG TGTPEDTVRT SRPVVGAVLA AAGLRRDSAN PDDVDQDTVV TLRSRQGSPE EVLAARRAAL AREGRSPAGV PHYTEDVTMT LHPRRRRVSL DGLLEEIG
|
| |