Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3892 |
Symbol | |
ID | 5672253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4656378 |
End bp | 4657625 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242771 |
Product | hypothetical protein |
Protein accession | YP_001508188 |
Protein GI | 158315680 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.361535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.109076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTGT CACTCTGGCG ACTCGCCTCT CCTGGGTCCG CGTCCCGCGT ACCTCCCTTC CTTCCTGCGC CGAGGAGAGC AGCACACGTG ACGAGACTTC GTACGGTGGC GGCCCTGGTC TGTTCCGTGG GGCTGGCCTT CGGCCTGGCC GCCTGCGGCG ACTCCCAGGA CGAGCCCGAG AACGCCGGTG TTGCCGGCGG GACGGCCGCC GGGGCGAGCT ACACCACCCC GCTCAAGGGG GTCTGCCCGG ACACCGTCGT CGTCCAGACG AGCTGGTGGC CCGAGGTCGA CTACGGCGCC ACCTACCAGC TCCTCGGCGC GAACCCGAAG ATCGACGCGG GCAAGTTCCG GGTGACAGGG CCGCTCGGCG CGACCGGCGT GAACCTCGAG ATCCGGTCCG GCGGCCCGGC GGTGGGCTTC GACTCGGCGT CGTCGTTGTT CGAGACGGAC GACGACATCC TGCTCGGCTA CCTGGACATG GACGAAATGA TCAGCAACTC GGCGGACCAT CCGTCAGTCG CGGTCCTCGC CCCCTACGCC AAGTCGCCGC TGATGTTCTT CTGGGGTGAC CCGTCGCTCG ACTTCACCAC GCTGGCCGAC ATCGGCCGGT CAGGGAAGAC GGTGCTCGCC AGCGACGAGC CCTATCTGGA CGTCCTCATC GGCGCGGGCC TGCTCCAGCG TCCGCAGGTC GACACCTCGT ACGACGGGGA GATCAGCCGG TTCGTGGCCG AGGACGGCAA GCTCATCCAG CTCGGGTTCG TCACCGACGA GCCCTACCGG CTCGAGCACG ACGTCAAGGA GTGGTCCAAG CCGGTGAAGT ACGTCCTCGT CGGGGACGAC TACCCGGCGT ACGCGAACGT GCTCGCCGTC CGCAAGGACA AGCTGGCGGC GAACCGGGCG TGCCTGGACG CGCTGGTGCC GCTGTTCCAG CGGGCGATGG TGGACTACCA GGCCGACCCG AAGCCGGCCA ACGATCTTAT GATCGAGATC ACCTCGAAGC TGGACACCGG GGGCTACGAG CTGTCGACCG GCCTGCTCGA GGACGGGAAC ACCAAGCAGC GGGAGCTCGG GCTGGTCGCG AACGGGGCCG ACGGCGTGTT CGGGAGCTTC GACACCGCGC GGGTCCAGAA CCTGATCGGC CGGCTGACGC CCGTCCTCAC CGCCGCCGGC ACGGCGCCCG CCGCCGGGCT GACCGCGGCT GACGTCGTCA CCGACGAGTT CATCGACCCG TCCGTCTCCC TGAAGTAG
|
Protein sequence | MRLSLWRLAS PGSASRVPPF LPAPRRAAHV TRLRTVAALV CSVGLAFGLA ACGDSQDEPE NAGVAGGTAA GASYTTPLKG VCPDTVVVQT SWWPEVDYGA TYQLLGANPK IDAGKFRVTG PLGATGVNLE IRSGGPAVGF DSASSLFETD DDILLGYLDM DEMISNSADH PSVAVLAPYA KSPLMFFWGD PSLDFTTLAD IGRSGKTVLA SDEPYLDVLI GAGLLQRPQV DTSYDGEISR FVAEDGKLIQ LGFVTDEPYR LEHDVKEWSK PVKYVLVGDD YPAYANVLAV RKDKLAANRA CLDALVPLFQ RAMVDYQADP KPANDLMIEI TSKLDTGGYE LSTGLLEDGN TKQRELGLVA NGADGVFGSF DTARVQNLIG RLTPVLTAAG TAPAAGLTAA DVVTDEFIDP SVSLK
|
| |