Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0721 |
Symbol | |
ID | 3903511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 829084 |
End bp | 830688 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878054 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_479834 |
Protein GI | 86739434 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.920452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.491923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATC CCGCGAGAGC CGGCCAACCG GCCGAGAACG ACCAACCGGC CGACGGGGTG CGGGCGGCGG TCTACAACCG GTTCTGGCAC TCCATGGGCG GCGGGGAACG CCACAACGGC ATGATCGCCC AGGTACTGGC CGCCGAGGGC GCCGTGGTCG ATCTCCTCGG GCACTCCGAG GTCGACCTCG CGGCGCTCGG CGAGCATCTC GGGCTGGATC TCGCCGACTG CCGGTACGTG CGGTTGCCGG ACCGGGGCGA GGAGGCCATC GCCGTCCTCT CCAGACGCTA CGACCTGTTC GTAAACGGTT CGTACATGAG CCGGATCATG CCGAAGGCCC GCCACTCGGC GTACCTGTGC TTCTTCCCGA CGCCGTTCGA CCACGACATG GCGGCCTGGC GCAAGGCCGC GGTCCGCACG GCCGGGCCGC TGCTGCGCGG GGTTCGCCCA GCGGTGAGCT TCGGCCAGGG CTGGTACCCG CCGGAGGGCG GCCGCCGCCG GCAGTGGACC TGGACGAACG GGGCGGGCAT CCTCGCCGTC AGCCCGGGCA AGCGGCGCAC CCTGCGCGCC GATATCGGCC GGCCCGGCGC GCCCGTCGGT ACGGCGTTGC GCCTCCTCGA CGCCGACGGC GGCGTGCTGG CCAAGCTGCG GATCGGCACC GACTTCACCC CGTTCGAGGT ATCGCTGCCG CCGTCGGCGA ACGGTACCGA GCTCACCCTG GTCAGTGACG CGTTCTCGCC AGGCGCGGCG GACGTGCGCG AACTCGGCGT GGCGGTGAGC CGCCCCCGGG TCACCGATGG TGGCGAGGGG CCGATCGCCC GGCTCCCGCT GCGTTTCCCC TGGCTGCTGC GGGACCCGGC CGACCTCGGC TACCTCGACG GCTACGACGT GGTCATGGCC AACTCGCGGT TCACCCGCGG CTGGATCCGC CGGTTGTGGA AGCGCGACGC CGACCTGCTG TTCCCCCCCA TCCAGGTGGA ACGGTTGCAT CCGGCGCCGC GGCGGGAGAA GGCCGTCGTC ACCGTCGGCC GGTTCTTCGC CCCCGGCCTC GGCCACGCCA AGCGACAGCT GGAGATGGTG CAGTGGTTCG GCGACCTGTA CCGTGCGGGC AACCTGCCCG ACTGGACGAT GCACGTCGTC GGCGGCTGTG AGGACTCTCA ACTGCCCTAC CTGGAACAGG TCCGGGCGGC CGGCGCGGGG CTGCCGGTGG AGATCCATCC CAATGCCCCG CGCGCCGAGG TCGAACGGCT GCTCTCGACC AGCTCGGTGT TCTGGTCGGC CACCGGGTAC GGCGAGGACG ATGACCGGCG TCCCTGGACG GCGGAGCACT TCGGGATGAC CACCGTCGAG GCGATGGCCG GGGGCTGCGT TCCCGTCGTC ATCGACCGGG CCGGCCAGCG AGAGATCGTC CGGCACGGAA TCGACGGCTA CCGATGGACC GGCCCGGAGC AGGTTGCCTC CTTCACTCGC CGGCTCGCCG CCGAGGACGG TCTACGCGGT CGGCTCGCCG CCGCGGCGAT CGAACGCGCC CAGACCTTCT CCGATGCGGC GTTCGCGCGG CAATGGCGGG AGATCGCCAT CCGGCACGGG TTGTACGAGC GGTGA
|
Protein sequence | MNDPARAGQP AENDQPADGV RAAVYNRFWH SMGGGERHNG MIAQVLAAEG AVVDLLGHSE VDLAALGEHL GLDLADCRYV RLPDRGEEAI AVLSRRYDLF VNGSYMSRIM PKARHSAYLC FFPTPFDHDM AAWRKAAVRT AGPLLRGVRP AVSFGQGWYP PEGGRRRQWT WTNGAGILAV SPGKRRTLRA DIGRPGAPVG TALRLLDADG GVLAKLRIGT DFTPFEVSLP PSANGTELTL VSDAFSPGAA DVRELGVAVS RPRVTDGGEG PIARLPLRFP WLLRDPADLG YLDGYDVVMA NSRFTRGWIR RLWKRDADLL FPPIQVERLH PAPRREKAVV TVGRFFAPGL GHAKRQLEMV QWFGDLYRAG NLPDWTMHVV GGCEDSQLPY LEQVRAAGAG LPVEIHPNAP RAEVERLLST SSVFWSATGY GEDDDRRPWT AEHFGMTTVE AMAGGCVPVV IDRAGQREIV RHGIDGYRWT GPEQVASFTR RLAAEDGLRG RLAAAAIERA QTFSDAAFAR QWREIAIRHG LYER
|
| |