Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3663 |
Symbol | |
ID | 5672029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4339543 |
End bp | 4341162 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242546 |
Product | glycosyl transferase family protein |
Protein accession | YP_001507966 |
Protein GI | 158315458 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.870133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.832443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGCAG CAACAACAGC TGATTTCTCC CGCTCCGGAA CCGACCAGGC CGGTTCCGAC CGGCGGCGCC CGCCCGGGTC CGCGGCCGAC GCGCTGCGGA TGCGGCTGTG CCCGCCGATG CCCGGTGACC GGGTGATCGG CTGGGTGGCC GCGCTCGCGG TCACCGCCGT CGCGGGCATC CTGCGGTTCT GGCAGCTCAC CGAGCCGCGG GGCATGAAAT TCGACGAGGT CTACTACACC AAGGACGCCT GGGGCCTGAT GACCTCGGGC TACGAGGTGA ACAGCGAGAC CTGCACCGGC CCCGCCTTCG TGGTGCATCC GCCCCTGGGC AAGTGGTTCA TGGCCGCCTC CGAGAAGATC TTCGGCTACA CCGACTGCGC CGGCGTCGCG CACGGCAGCC CAGAGCTCGG ATGGCGGTTC GCCTCGGCGC TGTTCGGCAC GCTGGCGGTG CTGGTGCTCA CCCGCACCGC ACGGCGGATG TTCCGGTCCA CCGTGCTCGG CTGCTTCGCG GGCCTGCTGC TGACCCTGGA CGGGCTGGAG TTCGTCCAGA GCCGGATCGG CATCCTCGAC ATCTTCCTGA TGACAGGGCT CGTGCTCGCG CTGGCCTGCC TGGTACTCGA CCGCGATCAC GGCCGGGCCG CGCTCGCCGC GCGCGTCGCC GCGGGCCCGC CGTCCGGTGG CGCGCCGTCC AAGGCGACCG AACGCTTCGT CCGCTACGGG CCGCGCGCCG GGCTGCGGCC CTGGCGGATC GCCGCCGGGC TGTGCCTCGG GGCGTCGATG GGCGTGAAGT GGAGCGCGCT CTACACGCTG GTCGGTCTCG CCGCGCTGGC CCTGGCCTGG GATGTCGGTG CCCGGCGCAC CGCGGGCGCG CGGCGGCCGG TGCGCGGGGC GCTGCGCCGC GACCTGCCGG CCTGGTCCGG CTGTTACATC CTGCTGCCGA TCGTGACGTT CCTGGCCACC TGGACGGGCT GGTTCGTCAC CGACGGCGGT TACAACCGGC ACAGGTACGG CGACGGGTTC TTCGCCGCCT GGCACGGCTG GTGGGACTAC CAGCAGGACA TCCTCGACTT CCACGAGCAC CTGAGCGCGC CGCACGTGGC GCAGTCCACG CCGCTGAGCT GGCTGGTGCT CGCCAGGCCG GTGGTCTACG CCTACGACAG CCCGAAGCTC GGCGAGCGGG GTTGCCACGC CGCCGCCGGC TGCTCCCGCG AGGTGCTGGC CCTGGGCAAT CCGGCGGTCT GGTGGGTCGG CACAGCCGCG CTGGTCGCGA TGCTCGCGCT GTGGGTCAGC CGGCGCGACT GGCGGGCCGC CCTGGTACTC GTCGGCTTCG GCTCGTCGTT CCTGCCGTGG CTGGCGTTCC CCAACCGGAC GATGTTCTTC TTCTACGCCC TGCCGTCGCT GCCGTTCCTG ATCCTCGGGA TCACCGCGTC CGCCGGCCTC GCGCTCGGCC CCCGCGACGC GTCGGACACC CGCCGCATGA TCGGGGCGTT GTCGTTCGGG CTCTACCTGG CCGCCGTCGT GCTGATGTTC GCCTACTTCT ACCCGATCCT CGCCGCCCAG ACGATTCCGC TGAGCTCCTG GCGCGACCGC ATGTGGTTCC CCGGCTGGAT CGTCGCCTGA
|
Protein sequence | MTAATTADFS RSGTDQAGSD RRRPPGSAAD ALRMRLCPPM PGDRVIGWVA ALAVTAVAGI LRFWQLTEPR GMKFDEVYYT KDAWGLMTSG YEVNSETCTG PAFVVHPPLG KWFMAASEKI FGYTDCAGVA HGSPELGWRF ASALFGTLAV LVLTRTARRM FRSTVLGCFA GLLLTLDGLE FVQSRIGILD IFLMTGLVLA LACLVLDRDH GRAALAARVA AGPPSGGAPS KATERFVRYG PRAGLRPWRI AAGLCLGASM GVKWSALYTL VGLAALALAW DVGARRTAGA RRPVRGALRR DLPAWSGCYI LLPIVTFLAT WTGWFVTDGG YNRHRYGDGF FAAWHGWWDY QQDILDFHEH LSAPHVAQST PLSWLVLARP VVYAYDSPKL GERGCHAAAG CSREVLALGN PAVWWVGTAA LVAMLALWVS RRDWRAALVL VGFGSSFLPW LAFPNRTMFF FYALPSLPFL ILGITASAGL ALGPRDASDT RRMIGALSFG LYLAAVVLMF AYFYPILAAQ TIPLSSWRDR MWFPGWIVA
|
| |