Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5499 |
Symbol | |
ID | 5673830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6657931 |
End bp | 6659637 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244354 |
Product | hypothetical protein |
Protein accession | YP_001509760 |
Protein GI | 158317252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.102476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAAG GTCCAGCCAC CGAGTCTCTT CCCCCGCAGG TGCGCGGGCG GCACCGTTCG GGCCGGCGGG ATTCCGCGCT GGCACCGTCC CAGCCGCTCG GGTCGGCGCA GGTAGCGGGC CAGGGGCTGC AGGAGGAGCC GCCGCCCGGC ACGCGGGGAG TGCCCGGTCC GGCGGTGATT CCCGGGCAGG CGACACAGCC GCGTCGGCAG CACCCGAGCC TCGCGGCCTT CCCCGGGCTC GCCGGCCTCG CGCCCGAGCG GCTGCGCGGC CAGAGCGACC GGATCCGGGC GGCCATGCGG GGCGCGATCC CCCTCGCCCT CGCCGGGCTG ATCGCCAACG CGGCCAACCT CGGGGTCACC CTGGTCATCG CGCGGGCCAT GAGCACCCGC TCCTACGGCG CCGTCGCGCA GCTTTTCGCG ATCTTCTTCG TCGTCTCGAT GCCGGGCAGC GCCCTGCTCG TCGGTGTCGT CCGGCGGATC ACGAACTGGC AGCACACCGG CCAGGCCGAC CTGATCGACG AGTGGATCGG CCGGGTCCGC CGGGCCGGCG TCATCATGGT CCTCGCCGTC GCCGTCCTGG CGATCATCGC GCGCGGGTTC GTCGCCCGCG AGCTCTCGCT GCCCGGAGCC GGCGGGGTCG CCGAGATCAT CATCGCCGGA GCGGCCTGGT GCCTGCTGTG CGTCGACCGC GGGCTGATGC AGTGCGGGCG GCTTTACCCG TCGCTCGCCG CGAACCTGCT GGTGGACGCG GCGGTCAAGA GCGGCTCGAC GATCGCGCTG GTCCTGGCCG GACTGGACGA GGCCGGCGCG GCCATCGCCG TGCTGCTCGG GGTACTCGCC GCACTCGCGC ACACCCGTTA CAGCCTGCGC CGGCACCCGT CGGCGATCAT GCAGGCCGAC CCGACCCGCC CCCAGGCCCA AGCGCCGCCG GCCCCCACCC AGCTCCCGGC CCCCACCCAG CTCCCGGCAC CGCCGGCCCG CCAGCGCCGG CTCACCGGCG GCCGGTTGGG ATCGCATCCG GCGGCGGGGC GTGGCGACGC CGACACCGGG GCGACGACGC TGCCGCTGCC GGTCGCCGGG CCGGCGGCCA CCGCGGAGCC GCGGCGGCTC GCCATCGAGG TCGGCGCCGC GCTGGTGACA CTGGCGTTCC TCGGTGTGCT GCAGAACATC GATGTGCTGC TGCAGGGCCG GCTCGCCCCG GACGAGTCGG GCTCCTACGC GGCCGTCTCC GTGGCGGCGA AGGTGATCGT GCTGGCCGCG ATCGTCCTGG CCGGGTTCCT CCTGCCCGAG GCCGCGGACC GCAACCATCT CGGGCAGCAT GCGCTGCATC AGCTCGGTGC GACGCTGGCG ATACTCGCGG TGCCCGCGGT GGGGCTGCTC ACCGTAGCGG CGATCGCACC CGACACACTG CTGTCGCTGG CTTTCGGGCC GCGTTTCACC AGTGCCTCCG GCGCTCTCCT CCCGCTGGCC GGAGCTATGA CCTGTCTCGG AGCGACCGTG TTGTTCTCCC ACTATCTGCT GGCTCTCGGC AAGCGTGCCG TGCTGGCCGT GCTCGCCGTC GCGACCGGCA CCGCCGTCGC CCTCATGGCC TGGGCGCAGG GCTCCCCCGT GTCGACCGCG CGGGCGAACT TCGGCTGCCA GGCCGTGCTC GCCGTCGTCA CCGGGCTGAT GGTGCTCGCC GCCGCCCGCC GGACGGCCCG CGCGTGA
|
Protein sequence | MNQGPATESL PPQVRGRHRS GRRDSALAPS QPLGSAQVAG QGLQEEPPPG TRGVPGPAVI PGQATQPRRQ HPSLAAFPGL AGLAPERLRG QSDRIRAAMR GAIPLALAGL IANAANLGVT LVIARAMSTR SYGAVAQLFA IFFVVSMPGS ALLVGVVRRI TNWQHTGQAD LIDEWIGRVR RAGVIMVLAV AVLAIIARGF VARELSLPGA GGVAEIIIAG AAWCLLCVDR GLMQCGRLYP SLAANLLVDA AVKSGSTIAL VLAGLDEAGA AIAVLLGVLA ALAHTRYSLR RHPSAIMQAD PTRPQAQAPP APTQLPAPTQ LPAPPARQRR LTGGRLGSHP AAGRGDADTG ATTLPLPVAG PAATAEPRRL AIEVGAALVT LAFLGVLQNI DVLLQGRLAP DESGSYAAVS VAAKVIVLAA IVLAGFLLPE AADRNHLGQH ALHQLGATLA ILAVPAVGLL TVAAIAPDTL LSLAFGPRFT SASGALLPLA GAMTCLGATV LFSHYLLALG KRAVLAVLAV ATGTAVALMA WAQGSPVSTA RANFGCQAVL AVVTGLMVLA AARRTARA
|
| |