Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6555 |
Symbol | |
ID | 5674870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7972968 |
End bp | 7974209 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641245404 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001510798 |
Protein GI | 158318290 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.381566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAGC CTTTCGATCC TGGCCTGGTA AAAGGCTCAT CCGTGGCGCT CGCGGCTTAC CAGGGGGAGT CGGAAATGGA CGTAGTGACG GTTTGTCGAA CGGTTTTTCG CCGGTGGTAC GTGGTTGTTC CGGTCCTGGT TGCCACCGTG GTCGTGCTCT GGGTCACAAC GTCCAAGGCA GAACCGGTCT ACCAGGCCGA GGCCCAGGCC ATCGTCGTCG GACCGTCGGT GGAACAGAAC GGTGAAATCG TACAGCCAGT CAACCCGCTG TCCTATTTCA ACGACTCTCT CAAACTGCTC ACACTGACCA TATCGAGGAT CATCAACGGC GAGCAGGCGC GGGACGGGAT CAGGGCGGCG GGCTTCCGCG CGGATTACAC GGTGACGGCG CCACCGGAAA CGACATTCCT AGAGATCACC GCGACCGATC CCGATCCCGC CGAGGCCACT CGCACAGCGG CGCAGGTACT GGACCAGATC ACCGCCAACA CGAACGAACT ACAGATGTCG GTGCCGGCGG GTCTGCGTTA TGAGGTTCAG CGACTGTTCC AACCGGTGCG CACGAGCAAC GAGCCTGGGC GCAGCCTCCA GTTGCTGGCG ACCATCGCGG TCCTGGGAGC GCTGGCCGCG TTGGGGCTGG CACTGTCCGT CGACGCCGCG GCCAGGCGAC GGGCCCGATC GCGCCGGGAG CGACCCCGGT CACACCGCGG CGCCTGGTCG CGCCGGGATC CAAAGCGGAC CGGCGGACGT CGACACCTGG CCGGGTCGCC GAAGGCGGTG TGGCCTGAGC CGGCGGTGTG GCCTGAGCCG GCGGTGCAGT CTGAGCCGGC GGTGCAGTCT GAGCCGGCGG TGCAGTCTGA GCCGGCGGTG TGGCCTGAGC CGGCGGTGCA GTCTGAGCCG GCGGTGCAGT CTGAGCCGGC GGTGCAGTCT GAGCCGGCGG TGCAGTCTGA GCCGGCGGTG CAGTCTGAGC CGGCAAAGGC ACCGCCGGTA CCCGCCCAGG CCGCACCGGT GCCGGCCGGG CAGCCCGCGG CCACGCCGAT GCCGTCCAAG GCCCCGAAGA CGGCCGCGCC GTCGGTGCCC GCCCAGATGG CACGGGAAGA AGCCGGTGAG CGGTCCTGGT CCCGCGAAGG GGCCGGCGGA TGGTCCTTCG CCGTAGCGTC CTTCGACGGA GCGTCCGACG ACGGAACTGG CGGGTGGTCC TTCGACGGAG TCGGCAAGAG GCGGCCGGGG GCAGGCCTGT GA
|
Protein sequence | MTEPFDPGLV KGSSVALAAY QGESEMDVVT VCRTVFRRWY VVVPVLVATV VVLWVTTSKA EPVYQAEAQA IVVGPSVEQN GEIVQPVNPL SYFNDSLKLL TLTISRIING EQARDGIRAA GFRADYTVTA PPETTFLEIT ATDPDPAEAT RTAAQVLDQI TANTNELQMS VPAGLRYEVQ RLFQPVRTSN EPGRSLQLLA TIAVLGALAA LGLALSVDAA ARRRARSRRE RPRSHRGAWS RRDPKRTGGR RHLAGSPKAV WPEPAVWPEP AVQSEPAVQS EPAVQSEPAV WPEPAVQSEP AVQSEPAVQS EPAVQSEPAV QSEPAKAPPV PAQAAPVPAG QPAATPMPSK APKTAAPSVP AQMAREEAGE RSWSREGAGG WSFAVASFDG ASDDGTGGWS FDGVGKRRPG AGL
|
| |