Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4871 |
Symbol | |
ID | 5673211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5844078 |
End bp | 5845043 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243726 |
Product | hypothetical protein |
Protein accession | YP_001509142 |
Protein GI | 158316634 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0785474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.13747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGCT CCACCGACCG TTTCAGCAGG CTGCTGGCCC TCGTGCCCTG GCTGCGTGCT CATCCCGGCG TGTCACTCGA GGCCGCGGCC GCGGAGTTCG ACGTCACGGT GCGCCAGCTG CGCGACGACC TCGACCTGCT GTTCGTCTGC GGGCTGCCCG GCGGCGCGCC CGGCGACCTC ATCGACGTCA GCTACTCCGG CGACCAGGTC ACCGTCGTTG ACCCCCAGAC GCTCGACCGC CCGCTGCGGC TGTCGGCGGA CGAGGCCACC GCGCTGCTCG TCGCCGCCCG GGCGCTCACC GACGTCCCCG GCCTGGCCGG GCGCGAGGCG CTCGATCAGG CACTGCGCAA GGTCGAGGAC GCCGTCGGCA ACGAGCCGGC CGGGCATGTG CGTGTCGCGC TCGACACCCA CGACGAGGTG CTCGCCATCC TGCAGGGGGC GATCAGGGAC CGCCGGCGCG TCCGGTTGCG TTACCTGGTG TGGTCGCGGG ACGAGATGAC GGTGCGCGAC GTCGACCCGA TGCGGGTGCT CGTCCGGGAC GGGCACTGGT ACCTCGAGGG ATGGTGCCAC CGGGCGTCGG CGGTGCGGCT GTTCCGGCTC GACCGCATCG ACGGCCGCGG TGGGGCCGTC GTCCTGGACG TCGTCGCGCA GCCGCCGCCG ACCGCGACGC CGCGCGACAC CGCGGACGGC GTGTACCGGC CGGGGCCGGA CGACATCCCG GTGCGCCTCG ACCTGGAGCC GCAGGCCCGC TGGGTCGTCG ACTACTACCC TGTCTCCGAC GTGTGTGAGC TGCCCGACGG GGGAGTGTCG GTGCTGCTGC GGGTCTCCGA GCTGTCGTGG CTGAACCGGC TGGTCCTGGG CCTGGGTGCG CAGGTGCGGC GGGTCGAGCC GGCCTCGGTG GCCGCGCGGG TCCACGAGCT GGCCGATCGT GCCCTCGCCG CCTACGCCGA GGAGCCCATA GGGTGA
|
Protein sequence | MSGSTDRFSR LLALVPWLRA HPGVSLEAAA AEFDVTVRQL RDDLDLLFVC GLPGGAPGDL IDVSYSGDQV TVVDPQTLDR PLRLSADEAT ALLVAARALT DVPGLAGREA LDQALRKVED AVGNEPAGHV RVALDTHDEV LAILQGAIRD RRRVRLRYLV WSRDEMTVRD VDPMRVLVRD GHWYLEGWCH RASAVRLFRL DRIDGRGGAV VLDVVAQPPP TATPRDTADG VYRPGPDDIP VRLDLEPQAR WVVDYYPVSD VCELPDGGVS VLLRVSELSW LNRLVLGLGA QVRRVEPASV AARVHELADR ALAAYAEEPI G
|
| |