Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5438 |
Symbol | |
ID | 5673769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6579199 |
End bp | 6581511 |
Gene Length | 2313 bp |
Protein Length | 770 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641244293 |
Product | hypothetical protein |
Protein accession | YP_001509699 |
Protein GI | 158317191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.180108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0529249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCACG GGCCCATTAC GCATGTGCCG TGGACAGATG TAACGTTCAT CTGTCCACGG CATCCCCAGG GCATGTGCAG AGACCTCCTG GGGACGTCGA CTACTGCCGA TTCGGGGGAA CAGGCAGGTG ACCCGGTGAT CGCATCACAG GCGAACGCAC CCAGCGAAGA ACTCGTCGCC AGAATTTTCG GTCCGAACGA ACGCTGGGGC TGGGAATACG TGCCCCCGCA TCCCGCGACG GCGCACCACG AGGCCTTCCA GCCGCCGTCG CGGATCCAGA TTCCGGTGCC CGACCTGCGG GCGCTCGAGT TCCGCAAGCG ACAGTTGTCA GGGAAGATCT GGCGGCCGAT CGTCGGCGGT CTGTTCCTGC TCGGCGGGCT CGGGACGCTT GCCGGATCAG TAGGCGGCCT CGGTCCTCTC CTCATCGGCG TCGTCCTGCT CGCCTGGTAT TTTGTACCGA TTCAGACGGT CAGCAATCAA ATGAAGTCCA TCCAGGCGGG CTACCAGGCG GAGATCGCCC GCCGCGAGCA GGACTACCAG GTCGCCTATG CCGCCTGGCA GCAACGCATC CACCAGCACG ACCAGGCCGA GCAATACCGG GTCGGATCGG CGCTGGAGTT CTATCCGTTG GATCCGCAGC GCCCGGCGCG GATCGACGTC TTCGGTGGCA CCGGCGCCGG CTGGACAAGC CTGCTGGCCA CCGGCGGGAC GTCGCTGCTC GCGGGCGGGT CCGGAATTCT GCTGCTCGAC CTGTCGGAGC TCAGCGTCGG CGCCGGCCTG GTCATGCTGG CCAACAACGC GGCCACGCCG ATCTCGGTCG ACGTCCGGGA GCTGCCCGGC TCGCTGGAAC GGATCGGCCT GCTCGGGGAG CTCGACCTGC GGCAGGTCGC CGAGCTGCTC GCCGACGCCT TCGACGCGGA CCGCCGGGGC GGGGGGGACC AGAGCCTGCG CGCGATCGAC TTCAACATCC TGCGCAGCGT TGCCTCCTCG ATTGAGGCGC CGCTGACCTT CGCCCGGCTC GCCGCCGCGT TGCGCATCCT CGACAACCAG AGTTCGGCCG TCGGCGAGGG CGTGTTCAGC GACTACGAGG TCCAGGCGCT GCAGCAGCGA ATGTATGACC TCGGCCAGCG GGAGCGCACG GCCGACCAGA TCAGCTTCCT GCGCACGGAG TTGGAGACGC TCGCCGGCTC CGACCCCACG GCGGCCGATG TCCCGCCGGC CGCACCGTCG GCCTGGTGGC CGGGCGGCGG GCTGCGGGTG CTGGCGACGA CAAGCTCGGG GCGCGGCAGC TCCAAGCGGC GCAAGCTGCT CACCGACCGG ATTCTGGTCG AGCGGTTGCT GCACCAGCTG CGCAGTCACG AGCGCGCCGC TTCCAACGAC GTCGTCGTGA TCGCCGGCGC CGACCACCTC GGCCGGGAAA CACTGACGAC GCTGACCCGG CAGGCGGAGG TCGCCCACGT CCGGCTCGTC CTGCTGTTCA AGAACCTCAG TGACGACGCG GAAAGGCTGA TCGGCACCGG CGACAGCGCG GCGATCTTCA TGCGGCTGGG TAACGCCCGG GAGGCCTCAA GCGCGGCCGA TCACATCGGT AAGGGCTTCA GTTTCGTGCT CTCCCAGGTC ACCAACCAGA TCGGCGACAG CTTCACCGAA GGGTTCGCCA ACAGCTACGG CGAGCAGGAC GGCACCGCGT TCACCCGGGG AGAGGGCCGC ACCAGCGGTT CCGGCCCCGG CGGCGGAAGC AGCGGCCGGA ACTGGAACCA GTCGAGCACG ACGTCGCATT CGTCGACGTG GACCAACACC GTCAACGTCT CCAACACGGT GAGCCGGAAC CTCGGCACGA CTCTGCAGCG GGAGAAGGAC TACACCGTCG AGCCGACCGT GCTGCAGAGC CTGGCCGCGA CCGCGTTCAT CCTGGTGGGC ACGTCCAGCG GTTCCGGTCG GGTCCGCCCC GGCGACTGCA ACCCAGGTCT TTTGCTGCTG CCCAAGGTCG CGGACAGTCC CCGGGACCTC ACCGCAGCGC CGCACACCCA CGAGGCGGGG GACCCGAACC AGGGCGGCGC CACCGCCACG CACCCGGGCC CGGTGTCACA GCGGGAGTTC CAACATCAGC TGCCGCCGGG GTACTCGCAC CCTCAGTCGG GCTACCCGCA GCAGCAGCCG GGCTACCAGC ACACTCAGCC GGCCTATCCG CAGCAGCAGC CGGGCTACCC ACCGTCGGGC TACCCACCGT CGGGCTACCC GCCGTCGGGT TACCCGCAGC AGCAGCCCGG CTATCCGCAA CAGCCGCCAC ACACCTGGCC GGGACAGCAG TAG
|
Protein sequence | MLHGPITHVP WTDVTFICPR HPQGMCRDLL GTSTTADSGE QAGDPVIASQ ANAPSEELVA RIFGPNERWG WEYVPPHPAT AHHEAFQPPS RIQIPVPDLR ALEFRKRQLS GKIWRPIVGG LFLLGGLGTL AGSVGGLGPL LIGVVLLAWY FVPIQTVSNQ MKSIQAGYQA EIARREQDYQ VAYAAWQQRI HQHDQAEQYR VGSALEFYPL DPQRPARIDV FGGTGAGWTS LLATGGTSLL AGGSGILLLD LSELSVGAGL VMLANNAATP ISVDVRELPG SLERIGLLGE LDLRQVAELL ADAFDADRRG GGDQSLRAID FNILRSVASS IEAPLTFARL AAALRILDNQ SSAVGEGVFS DYEVQALQQR MYDLGQRERT ADQISFLRTE LETLAGSDPT AADVPPAAPS AWWPGGGLRV LATTSSGRGS SKRRKLLTDR ILVERLLHQL RSHERAASND VVVIAGADHL GRETLTTLTR QAEVAHVRLV LLFKNLSDDA ERLIGTGDSA AIFMRLGNAR EASSAADHIG KGFSFVLSQV TNQIGDSFTE GFANSYGEQD GTAFTRGEGR TSGSGPGGGS SGRNWNQSST TSHSSTWTNT VNVSNTVSRN LGTTLQREKD YTVEPTVLQS LAATAFILVG TSSGSGRVRP GDCNPGLLLL PKVADSPRDL TAAPHTHEAG DPNQGGATAT HPGPVSQREF QHQLPPGYSH PQSGYPQQQP GYQHTQPAYP QQQPGYPPSG YPPSGYPPSG YPQQQPGYPQ QPPHTWPGQQ
|
| |