Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0222 |
Symbol | |
ID | 5668647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 270648 |
End bp | 271739 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239151 |
Product | hypothetical protein |
Protein accession | YP_001504595 |
Protein GI | 158312087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.338684 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTGG ATGCCCGTCG GGAAGTTCCT ACCGGAACAC GGTCGAGCGA GGTCACGTAC GACCATCTGA TCATTCCGCC GCACTGGCGG CGGCCGGCGG GGCCCGGGCC GGGGGAGCCG AACCCGCTGT TCCCCGCGGG CGAGCAGGCG ACGGCACCGC CGCCCGGCCC CGCCGTGACC GGCGCGTCTC CGATCAATCC GATGACCGGC CCGATGACCG GTCCGCTGCT GGTCGGCACG ACTGCCCGCG CCTTCGCCAA GCTGGACGAC ATGGTCCGGC TCGGTGACGA CGACCGCGCG GTGATCGACG ACCGACGGGC GGAGGCGGAG CGGGCGCTGC GGTCGATCTT CCCGCCGCGC TGCGCGCTGC CCCTGGTGGG CGTGGCCACG ATCGGGTCCG CCGGGCGCGA CACGATGATC CGTCCGCTCG ACGAGGTGGA CATCTTCGTG GTCTTCAGCG CGGCGAACAG CGCGTGGAAG CGCTTCCGGT GGGATTCTCG CGACCTGCTC GTCTGCGTCC GCAACGCCAT CGGTGGCGAC CGGGTGCAGA CGATCGGCAC CCGCGGCCAG GCGCTGCGCA TCGTCTACGA CGCCGCGCCG GACGTCCACC TCGTGCCGGC CTTCGACCAC CCCCGCGCCG GCTACGTCAT CCCGGACAGA GTGGGCGGCT GGCTGCCGAC CCGGCCGGAG CGGCACGCGA GCTGGACGAT GGACCTCGGC CCGCGGGTCA TCTCGGCGGT CCGGCTGCTC AAGGCGTGGA ACCGGGTGTG CGGCAGCCAC CTGCGCTCGT TCCACATCGA GGCGCTCGCG GGGCAGGTGC TCGCGGGCCG CGGTCTCAAC ACGCGCCAGG GCCTCGCCGA GGTGTTCCGG CACATGGACG AGGTCGGCCT CGTGGTCGGC GATCCGTCCG ACATCCGCGG TGACCTGTCC AGCTACCTCC GCCAGGACGA CCTCGAGGAT CTTGGCGCCT TCGTCCGCCA GGCACGCACC TACTCGGCCA AGGCGGTCGC GGCCGAGCGC GCCGGGGACC ACGAGGAGGC CGTCAGCCTG TGGGGCACCG TCTTCGGCCC GGAGTTCCCG ACCTTCGGGT GA
|
Protein sequence | MSVDARREVP TGTRSSEVTY DHLIIPPHWR RPAGPGPGEP NPLFPAGEQA TAPPPGPAVT GASPINPMTG PMTGPLLVGT TARAFAKLDD MVRLGDDDRA VIDDRRAEAE RALRSIFPPR CALPLVGVAT IGSAGRDTMI RPLDEVDIFV VFSAANSAWK RFRWDSRDLL VCVRNAIGGD RVQTIGTRGQ ALRIVYDAAP DVHLVPAFDH PRAGYVIPDR VGGWLPTRPE RHASWTMDLG PRVISAVRLL KAWNRVCGSH LRSFHIEALA GQVLAGRGLN TRQGLAEVFR HMDEVGLVVG DPSDIRGDLS SYLRQDDLED LGAFVRQART YSAKAVAAER AGDHEEAVSL WGTVFGPEFP TFG
|
| |