Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0821 |
Symbol | |
ID | 5669237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 959033 |
End bp | 960151 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239750 |
Product | signal peptide |
Protein accession | YP_001505185 |
Protein GI | 158312677 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.582134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGCAG AGCGCGGACG CGGCGACGGA CTCGGAACCC ACCGGCGCCG GCGGTGGGTC GCCGCGGCCG CGGTGGCGGC GGCACTATGC TCGGTCGCCC CGACCGTCCC GACGCTGGCG GCCGCCGCCG CCGGTGACCT CCCGGCCGGC GGTGCCACCG AAACCGTCCC GCCCGCGCCC TTCTCGGACC TGCCCGGCCC CTTCGCCGGC CCGTACTCCG AACCGCTCGG ACGTCCCTAC GCCGAGCCGA TGTCGGCGCC GGTCGCGGCG AGCCCCTACG AGCAGTCCCG GATCGAGCGG TACTGGTCCG CCGACCGGCG CGCCCGGGCG CAGACAGCGG ACACGGCCGA GCGCGGCGAC CGGCCGGCTC CGTCCGCGCT GGACGGGGCC GCCACGGCCG GCGAGGACCA GCGCGACGCC ACGCCGCCGC CGCCGAGCAC CGGCGCCCCC TACGTCTACG GCGGGCTGGC GACGAAGACG GTCGGCCGGT TGTTCACCAC GCTGCGCGGC GTCGACTACG CCTGCTCGGC GACGGTCGTG TCCAGCCCGG GCCGCGACCT GGCCGTGACC GCGGGCCACT GCCTGCACGA GGGCACGGGC GACCAGTTCG CGACGAACGT CGTCTTCATG CCCGGATACT CCGAGGGACG GATGCCGTAC GGGCTGTGGA CGGCCCGCCG GATCACTGTC ACCCCGGGCT GGGGCCTGGA CGGGGACTTC GACTACGACA CCGGGTTCGT CCTGTTCAAC GCGCGCGGGG GCCGTCACCT CGAGGACGTC GTCGGCGCCC AGCGCATCGC CTTCAACCAG CCCCGCACCT TCGCGCAGTA CGCGTTCGGA TACCCGCGGC TGGCGCCCTA CGACGGCAAC CGGCTGGTCT ACTGCGCCGG GGCACCCTCC CCCGACCCGT ACGGGACGGT GTCGCTCGGG CTGAACTGCG ACATGACCGG TGGCGCCAGC GGCGGACCGC TGATCATCGG GCTCGGCCGG GCCGGGCCCG GGGCCGGCTG GGTCGACAGC GTGGTCAGCT ACGCCTACGT CGGCGAGTCG CAGACCATCT ACGGCACCTA CTTCGGGCGG GCGATCGAGC TGCTGTACTA CCAGGCGATG GAACTCTGA
|
Protein sequence | MVAERGRGDG LGTHRRRRWV AAAAVAAALC SVAPTVPTLA AAAAGDLPAG GATETVPPAP FSDLPGPFAG PYSEPLGRPY AEPMSAPVAA SPYEQSRIER YWSADRRARA QTADTAERGD RPAPSALDGA ATAGEDQRDA TPPPPSTGAP YVYGGLATKT VGRLFTTLRG VDYACSATVV SSPGRDLAVT AGHCLHEGTG DQFATNVVFM PGYSEGRMPY GLWTARRITV TPGWGLDGDF DYDTGFVLFN ARGGRHLEDV VGAQRIAFNQ PRTFAQYAFG YPRLAPYDGN RLVYCAGAPS PDPYGTVSLG LNCDMTGGAS GGPLIIGLGR AGPGAGWVDS VVSYAYVGES QTIYGTYFGR AIELLYYQAM EL
|
| |