Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4130 |
Symbol | |
ID | 5672488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4913097 |
End bp | 4914023 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243006 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001508423 |
Protein GI | 158315915 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.745162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.353762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCCA CCACCACTGT CAGCTCGGGA GATCTCCCCG CCGGTAACGC GCCCGCCACC CGCGGCAACG GACGGTTCAT CATGGTCGCT GTGGCCCTAC CCGATCCGCT GCGCTGCGAC TGCGGGAACT GCTTCGGCCT GTGCTGCGTG GCACCGGCCT TCGCCGCCTC CGCGGACTTC GCCATCGACA AGAAAGCCGG GCAACCCTGC CCCAACCTGC AGGCGGACAA CCACTGCGGT ATCCACGGCC AGCTACGCGA CCGCGGCTTC ACCGGATGCG CCGTCTTCGA CTGCTTCGGT GCGGGACAAC ACGTCAGCCA GGTCACCTTC GACGGACAGG ACTGGCGGCA CACCCCCCGC CTCGCCCGAC AGATGTTCGC GGTCTTCCCC ATCATGCGCC AACTGCACGA ACTGCTCTGG TACCTGACCG CCGCACTCGA CCTCGGGCCG GCCCAGCCGA TACACCCCGA GCTGGCCGCC GCCCACGACC AGACCCGACG ACTCACCCAC GCCACCCCAG AGCAACTCAC CGCGCTGGAC ATCACCCGCC ACCGCCATGA CGTCAACCTG CTCCTGCAAC AGGCAAGCGA CCTGGCACGC GCACGGGCGG GCCGGCGACG GCCGCAGTAC CGCGGTGCTG ACCTCACCGG CGCCAACCTG CGCCGCACCG ACCTGAACGG GGCGAACCTG CGCGGAGCAC GACTCATCGG CGCCGACCTG CGCAACGCAG ACCTGACCAC AGCCGATGTG ACCGGCGCCG ACCTACGCGC CGCGAACCTG CGCGGCGCAC GCCTCGCCGA CACCCTCTTC CTGACCCAGA CACAGCTCGA CTCCGCGCAC GGCAACCTCG ACACCACCAT CCCCGCAACG CTGAGCCGGC CCCGCCACTG GCACGCCAAG CCTTCTCCGA TAGCGGATGC GGTCTGA
|
Protein sequence | MTPTTTVSSG DLPAGNAPAT RGNGRFIMVA VALPDPLRCD CGNCFGLCCV APAFAASADF AIDKKAGQPC PNLQADNHCG IHGQLRDRGF TGCAVFDCFG AGQHVSQVTF DGQDWRHTPR LARQMFAVFP IMRQLHELLW YLTAALDLGP AQPIHPELAA AHDQTRRLTH ATPEQLTALD ITRHRHDVNL LLQQASDLAR ARAGRRRPQY RGADLTGANL RRTDLNGANL RGARLIGADL RNADLTTADV TGADLRAANL RGARLADTLF LTQTQLDSAH GNLDTTIPAT LSRPRHWHAK PSPIADAV
|
| |