Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3680 |
Symbol | |
ID | 3905364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4414980 |
End bp | 4416350 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881006 |
Product | putative pep2 protein |
Protein accession | YP_482761 |
Protein GI | 86742361 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGACC ACCACGCGGA ACTGACCGGC CTGTTGCGCG AGTGGCTCCC CCGGCAGCGC TGGTTCGCCG GGAAGGGCAG ATCCGGGGGC GGCCTGCGGA TCCGTCACGA CGTCATGGTC TTCGAACCGC TACGGCTGCT CGTCGTTGAC GTCGACTACG ACCACGGTCC GTCGGACCAT TACCAGGTTC CGGTGGTCAT CCGCTCGGAC GCGCCCTTCG GCCACGAGGG CTTCCTGATC GGCGAGAGTG CCGGCGGCCT GGTCTACGAC GGCCTGCACG ACCCCGATGG CAGCTCGGCC CTGCTGAGTT TCCTCCGCCG GTCGACACAC CGCGACGGCC TCGTCGCCGA GGCGCTGGAA CCACTGGACA TCCTGCCCGC CCACGCCGTC GGCGCCGAGC AGTCCAACAC CTCGATCGTC TACGGCGACG CCTACATCCT CAAGGTGTTC CGGCGGCTCT GGCCGGGCGT CAACCCCGAT CTGGAGGTGA CCCGGGCGCT GGCGGAGGCG GGCAGCACAC ACGTGGCCCG CCCCGTCGCC TGGTACTCCG GCCGTGTCGA CGGGACACCG ACCACCTTCG GATTCCTCCA GGAGTACCTG CGCAGCGGTA CCGAGGGCTG GCGACTGGCG CTGGCCAGCG TGCGCGACCT GTACGCCGAG GCCGACCTGC ACGCCGACGA GGTCGGCGGC GACTTCGCCG CGGAGGCCGA GCGGCTGGGT ACCGCCACCG CCGAGGTGCA CGCCGACCTC GCCCGGACGC TGCCGACGAG GCCCGCGACC GCCGATGCCC TGCGCTCCCT GGTGACCTAC CTGCATGGCC GGCTGGATGC GGCGGTCGAG GCCGTCGCCG AGCTGCGTCC CTTCGAGGCG GCGATCCGCA AGAGCTACGA CGAGGTGACG GGGGCGTCGG CGCAGGCCGT CCATCCCCGG CCCTTCCAAC GGCTACACGG CGATCTGCAC CTGGGTCAGG TCCTGCGGGT CGACTCGGGC TGGGTGCTGT TCGACTTCGA GGGGGAGCCA GTCCGCCCGG TCGCCGACCG TGTGGGGCTG GAATCGCCGC TGCGCGACGT CGCCGGGATG CTGCGCTCGT TCGACTACGC GGCCCGGTCG ATGCTGCTCG AACGCGGTGA CGAGCCGTCG TTGGCCTACC GCGCCCAGGA GTGGGCAGAT CGCAACCGGG AGGCGTTCTG CCGGGGCTAC GGTGCCGCGG CCGGCCGCGA TCCTCGTGCG GATGCCGTGG TGCTGCGTGC GTTCGAGCTC GACAAGGCCG TGTACGAGAT CCTGTACGAG GCCCGCCACC GACCGGGGTG GATCGGGATC CCGCTGTCCT CGGTGGAACG ACTGACCCGG TCCAGCCCGT CGGCGGGGTA A
|
Protein sequence | MGDHHAELTG LLREWLPRQR WFAGKGRSGG GLRIRHDVMV FEPLRLLVVD VDYDHGPSDH YQVPVVIRSD APFGHEGFLI GESAGGLVYD GLHDPDGSSA LLSFLRRSTH RDGLVAEALE PLDILPAHAV GAEQSNTSIV YGDAYILKVF RRLWPGVNPD LEVTRALAEA GSTHVARPVA WYSGRVDGTP TTFGFLQEYL RSGTEGWRLA LASVRDLYAE ADLHADEVGG DFAAEAERLG TATAEVHADL ARTLPTRPAT ADALRSLVTY LHGRLDAAVE AVAELRPFEA AIRKSYDEVT GASAQAVHPR PFQRLHGDLH LGQVLRVDSG WVLFDFEGEP VRPVADRVGL ESPLRDVAGM LRSFDYAARS MLLERGDEPS LAYRAQEWAD RNREAFCRGY GAAAGRDPRA DAVVLRAFEL DKAVYEILYE ARHRPGWIGI PLSSVERLTR SSPSAG
|
| |