Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5742 |
Symbol | |
ID | 5674068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6978730 |
End bp | 6979665 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244595 |
Product | hypothetical protein |
Protein accession | YP_001509998 |
Protein GI | 158317490 |
COG category | [S] Function unknown |
COG ID | [COG2836] Uncharacterized conserved protein [COG4633] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTCGC CGCAGGCAGG GAAGGAACCG GACTGGCGCG CGCAGCTCGG CGATGACCTC GTGCCAGTCG GTGGGTTCCT GGCCGGCAAG CTGGTCTCGC ACGTGCTGCT CGGGGCGCTG CTGGGAGCGG CTGGCGGCGC GGTGGAGCTC TCGGTCGACG CGCGCACCTG GCTACAGATC GGCGCCGGCC TGCTGATCAT GTTGTTCGGT CTCGCCCAGC TGGGCGTACC CGGATTCCGC CGCGTCGTCG TGGAACCGCC CGCGTCGTGG ACGAGGATCG TACGCAGGCG CGCCCGTTCC CAGGCCGCGT TGGCACCGGC GCTGCTCGGC CTGGCCACCG TGCTCATCCC GTGCGGTGTG ACGCTGTCGG TGGAGGCGCT GGCACTCGCG TCCGGGTCCG CGCTGGCGGG CGCGGCCACG ATGGGGGTGT TCGTCCTTGG CACCGGCCCG CTGTTCGCCG TCCTCGGCTA CGCCGCCCGC AGGGCCGCGA CCGCGTGGCG CGGCCGACTG GCCGTGGTCA CCGGACTGGT CGTGCTGGCG ATGGGGCTCT ACACCCTCAA TGGCGGCCTG CAGCTCGCCG GCTCGCCCCT GGCCGCCAAC CGGCTCGCCG AAACCCTCGG TCTCTCCCAG CCACCGGCGG CCGACGCGTC CGCCGCCTCG ACCGCCGAGG GCCGCCAGAC CGTGGAGATC ACCGCCCGCG CCGACTCCTA CAGCCCGGGC AACGTCCAGG TGCAGGCCGG TGTGCCCACC ACACTGGTCG TGCACTCGGA CGACGTGCAG GGCTGCATCC AGTCGTTCGT CATCCCTGAC CGCGATGTCG AGGAGATTCT GCCGGCCCAG GGTGACACGG CGATCGACCT GGGCGTCCTG GAGCCCGGCC AGCTGCGCTA TGCGTGCGGC ATGGGCATGT ACACCGGCCT CATCACCGTC GTCTGA
|
Protein sequence | MRSPQAGKEP DWRAQLGDDL VPVGGFLAGK LVSHVLLGAL LGAAGGAVEL SVDARTWLQI GAGLLIMLFG LAQLGVPGFR RVVVEPPASW TRIVRRRARS QAALAPALLG LATVLIPCGV TLSVEALALA SGSALAGAAT MGVFVLGTGP LFAVLGYAAR RAATAWRGRL AVVTGLVVLA MGLYTLNGGL QLAGSPLAAN RLAETLGLSQ PPAADASAAS TAEGRQTVEI TARADSYSPG NVQVQAGVPT TLVVHSDDVQ GCIQSFVIPD RDVEEILPAQ GDTAIDLGVL EPGQLRYACG MGMYTGLITV V
|
| |