Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5873 |
Symbol | |
ID | 5674196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7128302 |
End bp | 7129561 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244723 |
Product | hypothetical protein |
Protein accession | YP_001510125 |
Protein GI | 158317617 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0436897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.694772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGGAA ACAGCGCACC CACGCCGAAC GGTGCCGAGC CCGACCCGTT CGCCGACCCG TTCGCCGATC TGGTCCTTGA CGAGGCCTTC ATCGCCGGAG CCACCCGATA CGAGGCCCCG GCCCGGACGA GAGCCGCGGT CGCCCGCTTC GGGCCGCTCG AAGACGGTTC AACGCCCTGG CGCTCGTACG GCGGCGGCGG CCGGCCACGG CGGGCGGGAG GTGCCTCCGG ACCACGGCTG GGCCGCGGAC GGGCGTCCCA GTCGATTCCG GAGATCTCGC CGCGGCCGCG GTCCCGGGGG CGGATCGTGC TGGCACTGAT CAGCGTCGTG CTGCTCTCGA CCACCGTCTA CGGGTTCGTG GCCTCGACCG TCGACAGTCC GCCGTCCACC ACGCTCTCGG CGGACCCGCC GACCCCGGCA GGGAGTGCCT CCGGTCGCGA CGCCATCGCC GAGGAATCGC TACCGGATCT CTACCGGCGG GACTGGACGG CCGGCCACTG CTACACCTGG CTCCAGCGGG ACGCCTCGAC CGCGGTCAAC GACGTCCCCT GCGCCGGCCC GCACCTGTTC GAAGCGGTCG GGCCGCTCGA CATCGGCTCC GCCTATCCTG AGGGGACGAC ATACCCGTCG CCCGCCGAGT GGGTGGCGCT CGGCAGGCGG CAGTGCGAGC CGCTCATCAC CGCGTATCTC GGCTACGGGC TCGACCCGTT CGGGCGCTTC GGCACGAGCG TCCTCCACTC CAAGGAAGCG GAGTGGAACA CCGGCGAGCG GGACATCGTC TGCGGCCTGT CACTCCATCC CTCGCCCACG GCACCGTACG AGGTGCCCGA CCTCGAAGGA CAGGTCCGCG GAGCCGACCA GGCGCTGACC TATCCCACCG GGACCTGCTT CCGTACGAAC ACCGAGGGCC GCAACGAGGT GGTCCCCTGC GAGCAGAGCC ACCATTCGCA GAGCGTGGGC ACGGCGACCC TGGCGGACAC CGCCGAGGGC GCACCGAGCA GTGCGCCCCT CTCGGCGGAG CGGCTCGGCG AACTCATCGA CGCGGCCTGC GCACCCCGCA TCGCGCCCTA TCTCGACCGA GGATTCGGGG GCGCATCCGT ACAGGGCGGA TCGCATTCCC TCCCCCCGGA AAGCTGGTGG GCGGGAACCC GATCCACCAC CTGCACCGTG AGCCTCGTCG ACCGGAACGG CAGAAAACTG GAGACGACCG GCCTACTCAG CCCGGACGAT GAATCGGGGT CGTTCAGCGC GAACGTGTGA
|
Protein sequence | MSGNSAPTPN GAEPDPFADP FADLVLDEAF IAGATRYEAP ARTRAAVARF GPLEDGSTPW RSYGGGGRPR RAGGASGPRL GRGRASQSIP EISPRPRSRG RIVLALISVV LLSTTVYGFV ASTVDSPPST TLSADPPTPA GSASGRDAIA EESLPDLYRR DWTAGHCYTW LQRDASTAVN DVPCAGPHLF EAVGPLDIGS AYPEGTTYPS PAEWVALGRR QCEPLITAYL GYGLDPFGRF GTSVLHSKEA EWNTGERDIV CGLSLHPSPT APYEVPDLEG QVRGADQALT YPTGTCFRTN TEGRNEVVPC EQSHHSQSVG TATLADTAEG APSSAPLSAE RLGELIDAAC APRIAPYLDR GFGGASVQGG SHSLPPESWW AGTRSTTCTV SLVDRNGRKL ETTGLLSPDD ESGSFSANV
|
| |