Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4011 |
Symbol | |
ID | 5672370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4790490 |
End bp | 4791815 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242888 |
Product | gas vesicle synthesis GvpLGvpF |
Protein accession | YP_001508305 |
Protein GI | 158315797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.923293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCCCGG AAGACGCTGA CTTCGGCAGC CTCGACGAAC TGGCGGTCCA GCTCGCGCCG GGCGTCCTGG ACGAAGCTGT GGCCGAAGCC CGCGAGGTCG CGCGTGGGCA ACTCGCCCGA CTCCTGGCGC GGGAGATCAC CAGAGCCGCC TGCGAGCGGG GGGTGGCGAG CGCGACATGC TCCTCGCCGG CGGGCGCAGG CCGACGGGAG CACAGTGAGC ACCGCCAGGC CCACCGCGAC GGCCCGGTGC CAGCCGGCCA CGGCGGCCCC GGGCCAGGCA GCCGCGACGG GCAGTCGCCC GGGGGCGTCG AGCGGGCCCG GGCGCTTTAC GCCTACGGGA TCGTCCCCGC CGGTACCGAC GTTTCGGGCC TGCCAGGCCT CGCCGAAGGA ACGACTGTCT GTGCCGTCAC TCGCGGCCAG GTCTCGCTCG TGGTCTCGGC CATCGACCCG GAGCTGCTGC GAGACGTCGA GGAGGACCTG TCGGAGACGG GACGTCTCGC CACCCTGGCC CGCGGCCACG ATCAGGTCCT GCGGGAACTG CAGGATCTCG CGCCCGTGCT GCCGCTTCGG TTCGGGACCG TCCTCCCCGG CGAGAGCGAG GCAGCCGTCG TCCTCGACGA TCCCGATACC GAGCTGCCGC GCGCCCTCGA CGCGCTCCGC GACGCGCGCG AGTGGGGATT CCGGATCGAC GCCGCGGGAC CGACCGAACC CCCCGCGAGC GGCGTCTTCC GGTCCACAGG CGCGGGCGAG TCGACCAGGG CCACGCCCTG CGCGGCGGGC GCCGCGGACG CGGGGAACAT CGCTCGTCCG GGTGCGGGCA CCGCGTACCT CTCGGCCCGC CGGGACGAAC TGCGTGAGGA GGAGCGACGA CGCGAGGAGA CAGCCCGGCT GGTCGAGTGG ACGCACCGGG AGCTGCTGGT CCACGCGCGT GACGTGGCCC GCCGTCCTGG CCGGCCGGAC AGGGTCTTCG ACTGCGCTTA CCTCGTCGAC CGCGACGAAG AGGAGGGATT CCTCGACACG GCGGAGCGGC TGGGTCCGCC GCTGGAGGAG GCCGGGTACG TCGCCGCGGT GACCGGGCCG TGGCCGCCCT ACTCCTTCGT CCATCTGACA CTGGGCGGGG ACGGCAGGAG CGGTTCCGAC CGCGGGTCGG GTTCCGCCGG GGCGGGCCCG GCGGAAGCGC CCCGCGCGAC AGCGCCCCGC GCGACAGCGC CCCGCGAGGG AACAACTCGC GTTGCGGCGG AAGCGGCGCC CGTCCGGTTC TCCGTCCCGA CCGGGGATCG GCCGGGTGAC GATCCAGAGT TCGAACCAGG TGAGGAGCGT GGCTGA
|
Protein sequence | MAPEDADFGS LDELAVQLAP GVLDEAVAEA REVARGQLAR LLAREITRAA CERGVASATC SSPAGAGRRE HSEHRQAHRD GPVPAGHGGP GPGSRDGQSP GGVERARALY AYGIVPAGTD VSGLPGLAEG TTVCAVTRGQ VSLVVSAIDP ELLRDVEEDL SETGRLATLA RGHDQVLREL QDLAPVLPLR FGTVLPGESE AAVVLDDPDT ELPRALDALR DAREWGFRID AAGPTEPPAS GVFRSTGAGE STRATPCAAG AADAGNIARP GAGTAYLSAR RDELREEERR REETARLVEW THRELLVHAR DVARRPGRPD RVFDCAYLVD RDEEEGFLDT AERLGPPLEE AGYVAAVTGP WPPYSFVHLT LGGDGRSGSD RGSGSAGAGP AEAPRATAPR ATAPREGTTR VAAEAAPVRF SVPTGDRPGD DPEFEPGEER G
|
| |