Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6364 |
Symbol | |
ID | 5674680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7725346 |
End bp | 7726779 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245213 |
Product | hypothetical protein |
Protein accession | YP_001510608 |
Protein GI | 158318100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTGA AACGAGCAGT CCGGAGCGCG GCGCTGGTGG CGGTGGCCGC CGCCACGTTC GCCGGCTGCG TACCGGTCAG CCCGGGCGGC GGAGGCGGCT CCGTCCCGAC CGCCTCGCCG TCCATCCCCC CGACCGCGCC CTCGTCGCCG TCCGGCACTC CGGCAGGCAC TCCGACCGCC TCGCCGTCGG TCCGGCCGGC CGTGCCGTCG TCGCCGACCG GCACCCCGGT GGGCACCCCG ACGCCTTCGC GGACCACGAC CCCGTCCGGA CCGACGTCCG GTGACTGCGG GCGGACGTCG GGAGCCTCAC CGGCGACGAG GATCACCGAG GTCGGCCTTG GGTCGCCGGT GGTCAGCTTC GCGCCGCAGG GCGACACGGA CCCGTTGCCG ACCGCGATCG CCGCCGCGCC GGGCGGGGGA TCCTGGCTGG CCTGGCTGGG CACCGACTCC AAGGTGCGCC TCGGCAAACT GGACTGCGGA GACCAGCTGG TCGGTACTCC CACATCCCTC GACGCGGTCG ACCTGCAGGA CGTCAAGGCC GATGCGGACG GTGTCGTGGT CCTGCTGACC CGTCCCGGCC CGCAGGGCAG CGGGACGCTG TGCGGCGGGA CGTCCAGCCC GACCAGGACC ATGTGGATGG TCCGCCTCGA CAACACCGGC AGACAGCTGT GGGAGCGTCA GGTCACCAAC CTGAGCAGCA GCCGCGGCGG CTACGACCCC GGGGCTCTGT TCGTCTGGTG GTACAACCAC CACGGCACCC TGGCCTACGA CGGCACCAAC TACGCCGCCT ACTTCGAGGC GGCGATCACC GTGGCCAACG GCGGCTGCGT CGACATCCAC GAGGGTGACC GGATGCAGGT GGTCAACGCC GCCACCGGCG CCCTGGTCTC CGGGCACGAC AGCTTCGACT GGGGCTGCAG CCACGCCTGG GACTCCCACA TCATCTGGGA CGCCCGCACC GGCCACTTCG CGATGGTCTG CGCCACGGAC AACAACTGCC GCATCGCCCG CCCCGGCACC GGCCAGACCG TCGTGCCCGG GGTCTGTGAC GGAACCCTGT TCGGCGGCAA CATCGTGCTG GCCGGCACCC CGGGCTACTG GACCGCGTGG AGCAACGGCA ACCAGGTACG GCTGGAGCAC TTCTCCACGG GCGCGTCCGA CCGGACGGTC CTCACCGCCG ACCGGACCCA GCACTCGCAC CTGGTCGGCT ACGGCGCCGG CAGGATGCTC CTGACCTGGA AGTCCGGAAC CTCGACCGCC GCCCAGGCGT ACGACACCAC CAGTGGCGGC ACCGTGGGCG GCCAGTTCAC GATCGCCGTG CCGGACCACA CCTACGTCGA GGCCAAGGCC TACCCCGACG GCAGCGTCGC CTTCCCCGCC GCCGGCACTT CCAACACGTC CATCCGGGTC GTTCGGATAA TGCCGCTCAC CTGA
|
Protein sequence | MSLKRAVRSA ALVAVAAATF AGCVPVSPGG GGGSVPTASP SIPPTAPSSP SGTPAGTPTA SPSVRPAVPS SPTGTPVGTP TPSRTTTPSG PTSGDCGRTS GASPATRITE VGLGSPVVSF APQGDTDPLP TAIAAAPGGG SWLAWLGTDS KVRLGKLDCG DQLVGTPTSL DAVDLQDVKA DADGVVVLLT RPGPQGSGTL CGGTSSPTRT MWMVRLDNTG RQLWERQVTN LSSSRGGYDP GALFVWWYNH HGTLAYDGTN YAAYFEAAIT VANGGCVDIH EGDRMQVVNA ATGALVSGHD SFDWGCSHAW DSHIIWDART GHFAMVCATD NNCRIARPGT GQTVVPGVCD GTLFGGNIVL AGTPGYWTAW SNGNQVRLEH FSTGASDRTV LTADRTQHSH LVGYGAGRML LTWKSGTSTA AQAYDTTSGG TVGGQFTIAV PDHTYVEAKA YPDGSVAFPA AGTSNTSIRV VRIMPLT
|
| |