Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2767 |
Symbol | |
ID | 3906478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3261053 |
End bp | 3262720 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637880090 |
Product | extracellular solute-binding protein |
Protein accession | YP_481856 |
Protein GI | 86741456 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.26917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCC CACGTCCCCG ATTCCACCCC GCCGTCCGCG TCGCGGCCGT CGCGGCCGCC GCCGCCATGC TCGGCGCCTG CGCGTCCGAT CCGGCACCCG CCGGCTCCAC CAGCGCGAAC GGGGCGGGCG GAACACCGCA CCGGGGCGGC ACGCTCACCT TCGCGGTGGC CACCGAGGGC GACTGCCTCG ACCCGCACGT CAGCCCGGCC GACGTCACCG CCGTCATCCA GCGGGGTGTG TTCGACTCGC TCGTGACGCA GCAGCCGGGC GGCGGGGTCG GCCCGTGGCT GGCCACGTCG TGGACGGTCT CAGCCGACGC GAAGACGTTC ACCTTCAAGC TCAAGGACGG CGTGACGTTC CACGACGGCA CACCGTTCGA CGCGGCGGCG GTGAAGGCGA ACTTCGACCA CATCGCGGCG AAGTCAACGA AGTCGCAGTA CGCGGTGGCG CTGCTCGGCC CCTACGTCGG CACCGAGGTC GTCGACCCGC ACACGGCCAA GGTCACCTTC TCCAAGCCGT ACGCGGCATT CCTCGCCGCG GCCGCCACGA CCTACCTCGG CATCGAGTCG TCGAAGAAGA TCGCCGACGC GCCGGACACC CTCTGCTCGG GCGGGCCGAA CTCGGTGGGT ACCGGGCCGT TTCGCTTCAC CCGTTATGTG AAGGGATCGG TCGCCGAGTT CACCCGCAAC GACGGCTACA CCTCGGCCCC CGGCGGCGCG AAGCACACCG GCGCCGCCTA CCTCGACAAG CTGGCCATCC GCTTCCTGCC CGAGGACGCG ACGAGGGTGA GCGCGCTGCG CAGTGGGGAG ATCGACGCCG CCGACGGCGT GCCGGCGCAG AACGTCGCCG CGATCGAGGG CGATACCTCC CTGCGGCTAC TCAAGGTGAT TCCGCAGAAC GCCAACTACT CGATCTACTT CAACACCAGG CGTGCGCCGT TCTCCGACGA GCGGGTCCGT AAGGCCGTCC AGCGCGCCGT CGACGTCGAC ACGATCGTGA AGACGGTCTA CGCCGGCCAG TACGCCCGCG CGTGGAGCAC GCTGACGCCA AAGAACATCG CCTACGATCG CAGCCTCGAG AAGGCCTTCC CCCATGACGT CACCGCGGCG AACAAGCTAC TCGACGAGGC CGGCTACACC GGCCGCGACG CCGACGGCTA CCGCACGAAG AACGGCGCGC GTCTCGTCCT CGACTGGCCC TACGTCGGCG CCTTCAACCG CGAGCAGCGC GACATCGTCA GCCAGGCGGT ACAGGCCGAC CTGAAGAAGG TCGGCATCGC CACCAAGCTC TACTCGATGG ACTCGGGCGC CTACACCACC CAGCGCAACG CCGGCGGCTA CGACCTGATC GCCTACAGCT GGGGCAAGTC CGACCCCGAC CTGCTCCGTA CACTGTTCGC CTCGAATCTG GCGTTCACCC AGGGCGGGGC GAACAGTTCC GGGATCGCGG TGCCGGAGGT CGACGACTGG CTCGCGACCG GCGCCGGCAC GACGGATCTC GCCGCCCGCA AGAAGGCCTA CGGGCAGGTG CAGAAATATG TGATCGACCA CGCGTACACC CTGCCGATCT ACGTGTTCAC CCGGACCGTC GGCACGGCGT CGAAGGTCCA CGACATCACC TTCGACGCGG ACGCCTTCCC GCTGTTCTAC GACAGCTGGG TGGGCTGA
|
Protein sequence | MKRPRPRFHP AVRVAAVAAA AAMLGACASD PAPAGSTSAN GAGGTPHRGG TLTFAVATEG DCLDPHVSPA DVTAVIQRGV FDSLVTQQPG GGVGPWLATS WTVSADAKTF TFKLKDGVTF HDGTPFDAAA VKANFDHIAA KSTKSQYAVA LLGPYVGTEV VDPHTAKVTF SKPYAAFLAA AATTYLGIES SKKIADAPDT LCSGGPNSVG TGPFRFTRYV KGSVAEFTRN DGYTSAPGGA KHTGAAYLDK LAIRFLPEDA TRVSALRSGE IDAADGVPAQ NVAAIEGDTS LRLLKVIPQN ANYSIYFNTR RAPFSDERVR KAVQRAVDVD TIVKTVYAGQ YARAWSTLTP KNIAYDRSLE KAFPHDVTAA NKLLDEAGYT GRDADGYRTK NGARLVLDWP YVGAFNREQR DIVSQAVQAD LKKVGIATKL YSMDSGAYTT QRNAGGYDLI AYSWGKSDPD LLRTLFASNL AFTQGGANSS GIAVPEVDDW LATGAGTTDL AARKKAYGQV QKYVIDHAYT LPIYVFTRTV GTASKVHDIT FDADAFPLFY DSWVG
|
| |