Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3454 |
Symbol | |
ID | 5671825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4083569 |
End bp | 4084990 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242342 |
Product | hypothetical protein |
Protein accession | YP_001507762 |
Protein GI | 158315254 |
COG category | [S] Function unknown |
COG ID | [COG5361] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAC AGCGGGATCA GGTGGCTGCC ATAGCAGTCG AAGCGTACAT TTTCGCCTAT CCGCTCGTGA CCATGGAGCT GACCCGACTC CAAGCCACCA ACGTCGAGCC CGGAGTGGCG CCGGGCCGGG CCCCGATGAA CCAGTTCGCA CATATTCGGG AGTTTCCCGA CGCCGATTTC AGGATGGTCG TCCGGCCGAA CTTCGACACC CTCTACTCCT CGGCCTGGGT GGATCTGACC GAAGGACCGG TGGTGGTCTC CGCACCCGAC ACGGATAATC GCTACTACAT GTTGCCCATT CTCGACATGT GGACGGATGT CTTCGCCACC CCCGGAAAGC GCTCCAGCGG CACGGCCGCG GCGGACTGGG CGCTGGTGCC GGCCGGGTGG AGCGGGCGCC TGCCGGCGGG CGTGGGACGC ATCGACGCTC CGACTCCGCA CGTCTGGATC ATCGGCCGGA CGCAGACCAA CGGCGAGGCC GACTACGACA CCGTCCACAA GGTGCAGGAC GGATTCCAGC TCTCCCACCT CGCGGACTGG GGGCGCGCTC CGATCGCCGC GACCGCCCGG GCTGTCGACC CGGACATCGA CATGACGACG CCGCCCCTGG ACGTCATCAA CGCCATGACC GGCGAGGAGT TCTTCAGGCG CGCGGCAGAG CTGATGAAGC TTCACCCGCC ACATGTCACG GACTGGTCAC AGATCCGGAG AATGCGCGCG CTCGGCCTGG TTCCCGGCGA GTCCTTCGAC CCGAACCGCC AGGGCCGGGC CGTTCGGGAT GCCGTCGCAG CGGCGCCCCG GACCGCTCAG AAAGCGATGA CCACGCGAGT TTCGACAATA GCGACCGTGT CCGACGGATG GCAGACCAAC ACGGACTCGA TCGGCGTCTA CGGCAACTAC TACATGAAAC GGGCCGCCGT CGCGATGATC GGTCTCGGCG CCAACCCCGC AGAGGAAGCC GTCTACCCGC TGCTGCTCAC TGACGCGGAC GGCGACCCGC TCGACGGATC CGTCGACTAC GTGCTCCACT TCGAGCGCGA CGAGCTCCCT CCGGTCTCCG CGTTCTGGTC GATCACGATG TACGACGAAC GCGGCTTCCA GGTGGCCAAC CGGCTCAACC GGTTCGCCCT CGGAGACAGG GATCCGCTGA CGTACAACGC TGACGGATCG CTCGATCTCC ATATCCAGAT GCGTCCCCCG GATCCGTTCG GGAATCGAAC TGGCTGCCGG CCCCGCTCGG CCCGCTGGGT GTCACGATGC GGCTCTACGC ACCCGACCCC GCGGTCCTGT GCGGAGCATG GTCACCGCCC CCGGTACGGA AGGCCGCGAG CCGCCCCGGC TGACAGCTCC CCGGCGGATC CCACCGACGG GCCGAAGGTC CGGCGTCCCA GGCTTCGGCC CCCGGCAGGA TCGTTGTCGT GA
|
Protein sequence | MTGQRDQVAA IAVEAYIFAY PLVTMELTRL QATNVEPGVA PGRAPMNQFA HIREFPDADF RMVVRPNFDT LYSSAWVDLT EGPVVVSAPD TDNRYYMLPI LDMWTDVFAT PGKRSSGTAA ADWALVPAGW SGRLPAGVGR IDAPTPHVWI IGRTQTNGEA DYDTVHKVQD GFQLSHLADW GRAPIAATAR AVDPDIDMTT PPLDVINAMT GEEFFRRAAE LMKLHPPHVT DWSQIRRMRA LGLVPGESFD PNRQGRAVRD AVAAAPRTAQ KAMTTRVSTI ATVSDGWQTN TDSIGVYGNY YMKRAAVAMI GLGANPAEEA VYPLLLTDAD GDPLDGSVDY VLHFERDELP PVSAFWSITM YDERGFQVAN RLNRFALGDR DPLTYNADGS LDLHIQMRPP DPFGNRTGCR PRSARWVSRC GSTHPTPRSC AEHGHRPRYG RPRAAPADSS PADPTDGPKV RRPRLRPPAG SLS
|
| |