Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4174 |
Symbol | |
ID | 5672529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4962464 |
End bp | 4965115 |
Gene Length | 2652 bp |
Protein Length | 883 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243047 |
Product | apolipoprotein N-acyltransferase |
Protein accession | YP_001508464 |
Protein GI | 158315956 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG0815] Apolipoprotein N-acyltransferase |
TIGRFAM ID | [TIGR00546] apolipoprotein N-acyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0124093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.508891 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGACA CCGCCCCCGT AACCGACACT GGACCGGTGA CCGACGCGCC CGCCACACCC GCCCTCGAGC CCGGCCCGCT CGGCCCCGCC CCGGCGGGCG CACCCCCCAC ACCCCCACAC CCCACCGGCC GCGCCCTACC GCGCCGCCTC ACCCGGCCCG CCCTGGCCGT CCTCGCCGGC GTCCTGCTCT ACCTCGCCTT CCCCCCGGTC GGCCTGTGGC CGCTGGCACC GGTCGCCCTG GCCGTTCTCA CCCTGACCGT CCGCGGGCGG CGGCTACGCG CCTCCTACGG GCTGGGAATG CTGTTCTCCC TCGCGTTCCT GCTACCCCTG CTGCGCTTCG TCTCCTTCGT CGGCGCCGAC GGCTGGATCG TGCTGTCCGC CGCCGAGGCC GCCCTACTCG CCCTCGTAGC ACCCGCCACC ACCCTCGTTC AGCGGCTACC CGCACCGTGG CTGTGGACGG GCGCCATCTG GGTCGCCCAG GAGGCGCTAC GGGGCCGTGC GCCCTTCGGG GGCTTCCCCT GGGGGCGGAT CGCGTTCAGC CAGCCGAACA GCCCCTACAC CGCCCTCGCA GCGCTCGGCG GCGCGCCTCT GGTCACCTTC GCCGTCGCCA CCACCGCCGC GCTGCTCGCC ACAGCCGTCA CCCACGCCAC CACGACCACC GCCGCCCGCG CCACGGGTAC AGATGCCGCC GGCCACGGCG CGGCAGGCGG CGCGGCGACC CACATCCGGC CACTACTGGC CACTCTCACC GGCGCTCTCG CGCTCACCCT CACCGGCCTC GCCGTTCCCC TGCCCACCAC CGCCCAGCAC GGCACCCTCA ACGTCGCCGC CGTCCAGGGC AACGTCCCCG AAGCCGGCGG CCTGGGCGCC CTCGGCGAGG CCTTCCAGGT CACCGACAAC CACGTCACCG GCACCGAGAA CCTCGCCGCC GCCGTCCGCG CCGGCCGCAC CCCCCAACCC GACCTCGTCC TGTGGCCGGA GAACTCCTCC GACATCGACC CCCTCACCAA CCCCACCGCC CACGCCGCCC TCACCCACGC CGCCACCGTC GCCGGCGCAC CACTGCTCGT CGGCGCCGTC CTCGACGGCC CCGGCCCCCG CCACGTCCGC AACGCCGGCC TCATCTGGAC CACCGACGGC CCCACCGGCG CCATGTACGT CAAACGCCAC CCCGTCCCCT TCGCCGAATA CCTCCCCGGC CGCGCCCTGC TCGAAAAACT CATCAGCCGC TTCGCCGACG AGATGCCCAA CGACTTCCTC GCCGGCACCG CCACCGGCGC CCTACCCGTC GCCGGCACCG TCATCGGCGA CGTCATCTGC TTCGAAGTCG CCTACGACGG CCTCGTCCGC GACAACGTCA ACCGCGGCGC CGAACTCCTC GTCATCCAAA CCAACAACGC CTCCTTCGGC CGCAAAGGCG AAAGCCAACA ACAACTCGCC ATGAGCCGCC TGCGCGCCAT CGAACACGGC CGCGCCACCA TCCAGGTCTC CACCAGCGGC CAGAGCGCCC TCATCACCCC CGACGGCACC ATCCTCACCC AGACCGGGCT CTACGAAGCC GGCGTACTGT CCGCACAACT CCCACTACGC ACCACCCACA CCCTCGCCAC CCGCCTCGGT ATCGTCCCGG AGGCGGTGCT CACCACCCTC GGCGCACTCG CCATGATTGC TGGACTGACC CACCCCCGAC GCACCACCCA CCAACCCACC CCGACATCCA CCGACCGGAC CGGCGACGAC CACCAGCACC TGGGGGAGAC GCACCAACCC GCCGCCGGCA CCGACCAGAA GAGGAGAGGC GTGGAAGCCA CCCGCGTCGT CGTCTGCGTC CCGACCTACA ACGAACGGGA GAACCTGCCG GACACCACGC GCCGGCTACG CCAGGCGAAC CCCGCGGTCC ACCTGCTCGT CATCGACGAC GCAAGCCCCG ACGGCACCGG GAAAATCGCC GACGAACTCG CCGACGACGA CGACCACATC CACGTCCTGC ACCGGCCCGG CAAATCCGGC CTCGGCTCCG CCTACATCGC CGGCTTCACC TGGGCCCTGC AACACGGCTA CGACATCATC GTCGAAATGG ACGCCGACGG CTCCCACCAG CCCGAACAGC TACCCCGCCT ACTCGACGCC CTCACCGACG CCGACCTGGC CATCGGCTCC CGCTGGGTCC CCGGCGGCAC CGTCCACAAC TGGCCCCGCA GCCGGCTCGT CCTCTCCCGC GGCGCCAACG CCTACGTCCG CGCCGCCCTC GGGGTGCCCC TCCACGACGC CACCGCCGGG TTCCGCGCCT ACCGCGCCGA CGTCCTGCGC GCCCGCGACC TCGACCAGGT CGCCTCCCAG GGCTACTGCT TCCAGGTCGA CCTCGCCTGG CGCTCCTGGC AGGCCGGGTT CCGCGTCGTC GAAGTCCCCA TCGACTTCGT CGAACGCGAA CGCGGCGCGT CGAAGATGAG CCGCGCGATC GTCGCCGAAG GATTCTGGCG CGTCGGCTGG TGGGCCCTGA CCTCCCTGCG CCGCGGCCCC GCCAGCACCA GCCAGCACAC TGGCGCGGAC GCGGCGATCC CCGCCCCCGC CCGGCCCACC GACCCCACCG CGGACAGCCT CACCACCCCG ACCGCCACCG GACCCGACAC CGTCGACGCC GGCCGGCCCT GA
|
Protein sequence | MVDTAPVTDT GPVTDAPATP ALEPGPLGPA PAGAPPTPPH PTGRALPRRL TRPALAVLAG VLLYLAFPPV GLWPLAPVAL AVLTLTVRGR RLRASYGLGM LFSLAFLLPL LRFVSFVGAD GWIVLSAAEA ALLALVAPAT TLVQRLPAPW LWTGAIWVAQ EALRGRAPFG GFPWGRIAFS QPNSPYTALA ALGGAPLVTF AVATTAALLA TAVTHATTTT AARATGTDAA GHGAAGGAAT HIRPLLATLT GALALTLTGL AVPLPTTAQH GTLNVAAVQG NVPEAGGLGA LGEAFQVTDN HVTGTENLAA AVRAGRTPQP DLVLWPENSS DIDPLTNPTA HAALTHAATV AGAPLLVGAV LDGPGPRHVR NAGLIWTTDG PTGAMYVKRH PVPFAEYLPG RALLEKLISR FADEMPNDFL AGTATGALPV AGTVIGDVIC FEVAYDGLVR DNVNRGAELL VIQTNNASFG RKGESQQQLA MSRLRAIEHG RATIQVSTSG QSALITPDGT ILTQTGLYEA GVLSAQLPLR TTHTLATRLG IVPEAVLTTL GALAMIAGLT HPRRTTHQPT PTSTDRTGDD HQHLGETHQP AAGTDQKRRG VEATRVVVCV PTYNERENLP DTTRRLRQAN PAVHLLVIDD ASPDGTGKIA DELADDDDHI HVLHRPGKSG LGSAYIAGFT WALQHGYDII VEMDADGSHQ PEQLPRLLDA LTDADLAIGS RWVPGGTVHN WPRSRLVLSR GANAYVRAAL GVPLHDATAG FRAYRADVLR ARDLDQVASQ GYCFQVDLAW RSWQAGFRVV EVPIDFVERE RGASKMSRAI VAEGFWRVGW WALTSLRRGP ASTSQHTGAD AAIPAPARPT DPTADSLTTP TATGPDTVDA GRP
|
| |