Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0184 |
Symbol | |
ID | 5668609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 222282 |
End bp | 224009 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239113 |
Product | hypothetical protein |
Protein accession | YP_001504557 |
Protein GI | 158312049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00494452 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACGA CTGTCCCACG TCGGCACCGC GCTCGCCGCC GCGACCGGTA CCTTCCGGGG ATCGTGAACA GCGCGGGAGT AGCAGGTCCG GCAGGACGGG CGCGCGGCCC GGCACGCTGG TACGCGGCCC TGCGAGGACG GCCGCTGATC GTCCGGTGGG TTGTCAGCCG GGCCATGCTG CTCACCCTGG CGCTGGCCGG CCAGGTGTTC GGCGCGCAGC AGAGCGTTCT CGGTGATGTC GATCTCTACC GGGAGTGGGG CCACACGCTC GTCGGTGACG GGACGGTGCC CGGGGACGAG AAATGGCAGT ACCCGCCGGG CGCGGCGGTC GTCCTGGCGC TGCCCGCGGT GCCGCGCGAG CTGGCCGGCG TCCCGTATGA GGTGTCCTTC TACCTGCTGA TGCTCGTCGT GGACGCGGTG CTCACCCGGG CGCTGGCCCG CCGCTCACCC GCGGCAGCCC GGTACTGGCT GCTGGCCACG CTCGCCCTCG GCCCGGTGAT GCTGACCCGG TTCGACCTGG TGCCGGCGGC CGCGGCGCTG GCCGCGGTCC TCGCCCTGGA CGGCTCCGCG CCCGGTGGCG CCCATGGCTC CGGTGACGCT GAGCCTCCTG ATGGCGCTGG TCGGCCGGGG CACTCCCGGC GCCTCCGCCC GTTCAGTTCT TTCGGTGCGC TTGGTTCACT CGGTCGTTTC GGTGGCTGGG TGGTGCTGGG CGTGGCGGTG AAGGTGTGGC CCGGGTTGCT CCTGGTGGCC CTGGGACGAC GCGGGCTGAC GCGCCCGGGC TCGCCGGTGC TGGGCCGGGT GGCGCGGATC GTGGCGGGCG CGGGTGTGAC GGCGGCCGTC CTGGCCGCCA TCCTCGTGCT CGCCGGTTGG TGGCGGGGCG CGCTGGGGTT CCTGGACGCG CAGAGCGCCC GCGGCCTGCA GATCGAGGCG GTGCCGGCCA CCCCGTTCGT CGTGGCGCGG ATGCTCGGGA TCGGCTCCGC CCCGGAGTAC TCCTACGGGT CACTCCAGTT CGACGGCGGG CTCGCCAGGG CAGTGGCGAC GGCCTGCTCG CTCGCCGAGG TGATCGTGAT CGCCGCCGCA GTCCTGTGGT GGTGGGCGCG GCGGTCCCCT GGTCGGGCCG TGGACGCCTC CCCCCGGCCG GGCGGGGACG GTTCGATGGA CACGGAGGGC ACATCGTCGG TTGCCGGGCG TGGCCTCGCG CTGGTCCTGC TGATTGTGAT CACCTCGCGG GTGCTCAGCC CGCAGTACCT GGTCTGGCTG CTCGTCCTCG CCGCCGCCGT CCGCCCGTCC ACCGCCGGCG CCGAGCGTGT GCGGGCCGAG TCATCGGACC ACTCCGGGAG CACAGGCGGC CGGCTCGGGT GGTCCGGGTG GCGTGGGCGG AAGGTCGACG CGGCGGGGCT GCTCGCCGTC TGCGCTGTGC TGTCGCAGGT TGTCTACCCG TGGCGCTACA ACGACGTCGT GCAGGGGCGG GTCGTCGGCG GGCTGCTGCT GGTCGCGCGC AACGCCGTGC TCGTCACCGC GGCCTGGTAC GCGCTTCGGG CCGCGGCGCG GGAATCGTCC GGCGACGGAC AGTCGTCCGG CGACGGACCG CCGGCACAGT CGCCGCCGAC ACCACTGCCG CCAACACCAC CACCAGGGCA GCGCCGAGCA GCGGGTCGGC CCGGCCGAGC AGTCGCCAGG TGGCGGCGAG CGGCGACAGC GTCGCGAGGC ACAGCCAAAC CGACGTGA
|
Protein sequence | MATTVPRRHR ARRRDRYLPG IVNSAGVAGP AGRARGPARW YAALRGRPLI VRWVVSRAML LTLALAGQVF GAQQSVLGDV DLYREWGHTL VGDGTVPGDE KWQYPPGAAV VLALPAVPRE LAGVPYEVSF YLLMLVVDAV LTRALARRSP AAARYWLLAT LALGPVMLTR FDLVPAAAAL AAVLALDGSA PGGAHGSGDA EPPDGAGRPG HSRRLRPFSS FGALGSLGRF GGWVVLGVAV KVWPGLLLVA LGRRGLTRPG SPVLGRVARI VAGAGVTAAV LAAILVLAGW WRGALGFLDA QSARGLQIEA VPATPFVVAR MLGIGSAPEY SYGSLQFDGG LARAVATACS LAEVIVIAAA VLWWWARRSP GRAVDASPRP GGDGSMDTEG TSSVAGRGLA LVLLIVITSR VLSPQYLVWL LVLAAAVRPS TAGAERVRAE SSDHSGSTGG RLGWSGWRGR KVDAAGLLAV CAVLSQVVYP WRYNDVVQGR VVGGLLLVAR NAVLVTAAWY ALRAAARESS GDGQSSGDGP PAQSPPTPLP PTPPPGQRRA AGRPGRAVAR WRRAATASRG TAKPT
|
| |