Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2252 |
Symbol | |
ID | 3905020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2629297 |
End bp | 2630706 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879583 |
Product | extracellular solute-binding protein |
Protein accession | YP_481349 |
Protein GI | 86740949 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00844632 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0063434 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAGCCG TTTCCCGGCA CGACGATGCT GCTGGACATG AACGGGACGC AGGACAAGAA CGGGACGCAG GACAGGAACG CCCGGGCCGC CGTCCGCGGC GGCACCGGGC CCGGCTGGTC GTCACGGCGT TGTCCGCCGC GGTCGGCCTG CTCGCCGTGG CGGCCTGCGG CTCGGACAGC GACACCGCCT CCGGTACCCC GGCCGCGACA CCGCAAGGTG ACACCCCGGC GACCATCACC TTCCTGTCCT ACAACTACGG CACCCCGGGT CTCGGTGGTA CGGGAACCCA GGCACTGCTC GACGCCTTCG CCAAGGCGCA CCCGAAGATC ACGGTCAAGC CGCAGGGCGT CGCGGTGAAG GACGTCCTCA CCCGGCTGCG CACCGACACC GCCGCCGGTG ATCCGCCCGA CGTCGCCCAG ATCGGCTGGA GCAAGATGGC CGAGGCGGTC GACGCCCTGC CTATCACCCC GGTCCAGAAG GTCGCCGGCA GCGAGTGGGA GTCGGCCACC GCCGGCATCT CGAAGAGCAT CCTGTCGGCC GTCTCCACCA ACGGCGTCGT CGCGGCGATG CCGTTCACGA TGTCGATCCC GGTCATGTAC TACAACGCCG ACCTGTTCCG CGCCGCCGGC CTGGACCCGC AACACCCGCC GACGACCCTC GCCGACGTCA AGGCCGCCGC TTTGAAGATC AAGGCGACCG GTAAGCAGGG CGTCTACATC AGCGTCGTCG ACAGCGGGAA GTCGGACTAC CTGACCCAGT CGGTCGTCAA CTCCAACGGC GGCTCGCTGG TGGACAAGAA CGGCGGCGTC ACCCTCGACA AGCAGCCGGC CGTCGAGGCG CTGGCCACGA TCGCCGACCT GACCGCCTCG GGTGCCCAGC CCGGAGTCAA GGCCGAAGCG GCCCTGGCCG CGTTCACCAA GGGTGACCTC GGCATGTTCG TCACCAGCAC GGCGCTGCTC GCCAGCGCCC AGAAGGCGGC GGCGGGCAAG TTCGAGCTGC GCACCGCGGG TCTGCCGTCC TTCGGCACCA AACCCGCCCG CCCGACCTAC TCCGGCGCCG GGCTCGCGGT GCTGGCCAAG GACCCGGCCA AGCAGCGCGC CGCCTGGGAG TTCATCAAGT TCCTCACCTC CGACGAGGGC TTCGAGATCA TCACCTCGAA GATCGGTTAC CTGCCGCTGC GACAGAGCGT GGCGACGAAG CTCGCCGGCA CCCCGATCGT GAAGCTGCTG GAACCGGCCC TCGACCAGCT CGACACCGTC ACCCCCTACA CCTCGTTCCG CGGGGCGAAG GCCAACCAGG CCGTCGTCGT GCTGCAGGAC GAGGCCGTCG AACCGATCGT CCTGCGCGGG GCCGATCCCC AGGCGACCCT GAGCAAGGCC GCCGAGAAGA TCCGCGCACT CTCCTCCTGA
|
Protein sequence | MTAVSRHDDA AGHERDAGQE RDAGQERPGR RPRRHRARLV VTALSAAVGL LAVAACGSDS DTASGTPAAT PQGDTPATIT FLSYNYGTPG LGGTGTQALL DAFAKAHPKI TVKPQGVAVK DVLTRLRTDT AAGDPPDVAQ IGWSKMAEAV DALPITPVQK VAGSEWESAT AGISKSILSA VSTNGVVAAM PFTMSIPVMY YNADLFRAAG LDPQHPPTTL ADVKAAALKI KATGKQGVYI SVVDSGKSDY LTQSVVNSNG GSLVDKNGGV TLDKQPAVEA LATIADLTAS GAQPGVKAEA ALAAFTKGDL GMFVTSTALL ASAQKAAAGK FELRTAGLPS FGTKPARPTY SGAGLAVLAK DPAKQRAAWE FIKFLTSDEG FEIITSKIGY LPLRQSVATK LAGTPIVKLL EPALDQLDTV TPYTSFRGAK ANQAVVVLQD EAVEPIVLRG ADPQATLSKA AEKIRALSS
|
| |