Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6543 |
Symbol | |
ID | 5674858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7958399 |
End bp | 7959388 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245392 |
Product | glycosyl transferase family protein |
Protein accession | YP_001510786 |
Protein GI | 158318278 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.118819 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTCG ACGTCTCAGT CCTGATCGTC TCATACAACA CCGGTGAAAT GACGGCGACG TGCCTGGAGT CCCTCGAAGC CACGTCCGGC GGCCTGCGCA TCGAGGTCAT AGTCGTGGAC AATGCCTCGA CGGATGGATC CGCCGAAATC GTCCGTAAAC GTTTTCCCTC GGTCAAGCTG ATCGAGCTGA GTTCGAACAT CGGTTTCGGC CGAGCGGTCA ATCTCGCGGC ATCTAATGCC CTGGGAAACT ATCTGCTCCT GCTCAATCCC GACGCCGTCG TGCTCACTGG CGCGGTTCAG AATCTGTTGG AGTTCGCCCG GGCCAACCCA CGGCACGGCA TCTACGGCGG CCGCACCTTC GACCCGCAGG GCGCCGCCAG CCACACCTCG TGCTTTGGTG CGCCCACCGT CTGGAGTCAC GTCTGCTTCG GCATGGGCCT GTCTACCGTC TTCCGGCGCT CACGTGTCTT CGATCCGGAA TCGCTGGGAC GGTGGGAGCG TGACAGCGTC CGGACTGTGG GCGTGGTGAC CGGCTGCCTG CTGCTCGTTC GGCGGGCACT GTTCGAGCAG TTGGGAGGCT TCGACCCCCG CTTCTTCATG TACGGCGAAG ATGTCGACCT GTCGGTGCGC GCCCGCCGAG CCGGCTGGGA TCCGGTGATC ACGCCCGACG CGGTCGTGAT TCACCACGGT GGCGCGTCGT CGTCCAACTG GACGGGCAAG CATGTCCTGG TGATGAAGGG GAAGACGACG CTCGCACGGG TGCACTGGAC CGGATGGCGT AGCGGGCTGT GCCTGACGAT GCTGTGGCTC GGGGTGACGC TACGGGCGAT GCCCACGGTG GCATCCGGTG GCCGGTCGGC GGGTAGTGGG ACCAGCGACT GGCGTGGTCT GTGGCACCGC AGAGCCGACT GGTGGTCCGG GTACGAGCAG GTCGCACCCG AGCCGGCCGA GACCGGACGC GAAGAAAGCG CGGCCCGGCC ACCGGTGTGA
|
Protein sequence | MAVDVSVLIV SYNTGEMTAT CLESLEATSG GLRIEVIVVD NASTDGSAEI VRKRFPSVKL IELSSNIGFG RAVNLAASNA LGNYLLLLNP DAVVLTGAVQ NLLEFARANP RHGIYGGRTF DPQGAASHTS CFGAPTVWSH VCFGMGLSTV FRRSRVFDPE SLGRWERDSV RTVGVVTGCL LLVRRALFEQ LGGFDPRFFM YGEDVDLSVR ARRAGWDPVI TPDAVVIHHG GASSSNWTGK HVLVMKGKTT LARVHWTGWR SGLCLTMLWL GVTLRAMPTV ASGGRSAGSG TSDWRGLWHR RADWWSGYEQ VAPEPAETGR EESAARPPV
|
| |