Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2423 |
Symbol | |
ID | 3906406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2814377 |
End bp | 2815408 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879753 |
Product | extracellular solute-binding protein |
Protein accession | YP_481519 |
Protein GI | 86741119 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0450755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.051876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATCA TCCGGGTCGC GACGGCGCGA GCGGTGACGG TACTGATTAC CGTCGCCCTG GCGGTCGGCG TCACCGCCTG CGGCACGGTC GACAGTCCGC TGCCGGCCCA GGCCCGACCC CTGGCCCCCG CGACGCCGAC GCCCCCGCGG CCCGCCGCCC TCACGCCTGC CGCCCTCACG CCTGCCGCCG CGGCCGTGAC GCCGACACCC ACCTGCGACG ATCCGCAGGC CAGCTGGCGC CCGCCGGGCC GGCTGCCCTC GCCGGGCGCG ATGCCCACGG GCACCGTCCT GGAGACGATC GAACGCCGCG GCTACCTCAT CGCCGGGGTG CTGGCCGACG TCCCGCCCTT CGGGTCGATC AGCCCGTTCA CCGGCCAGTT CGAGGGCTTC GACGTGGAGA TCGCCAACCT GGTCGGCCGG AGGATCTTCG GCGCGGACGG ACACGTGCGG TTCCGCGCGG TCACCTACGC CGAGCGCATC CCGGTCCTGC GCGACGGCGC CGTCGACGTC GTCGTGGCGA CCATGACGAC GAACTGCGAG CGGCGCGCGC TGGTGGACTT CTCCGCCGTC TACTACAACG AGACGCAACG GGTCCTGGTC CCCCGCGACT CGCCGTACCA GGGGATGGAC GATCTCGGTG GGCGACGGGT CTGCACGGCG GCCGGCGCGG CGGCCGGCGC GGCGGCGACC ATCCGGCGGG CACCGTCGCG CCCGGTGCTG CGCACCGTGC CGAACATCGC CGACTGCCTC GTGCTGCTCC AGGCCGGGGA GGTCGACGCC GTCATGACCA CCACGGCCAT CCTCAACGGG ATGGCCGCCC AGGACCCGCG GCTGTACGTT GTCGGACCGG CCCTGTCGGA CGAGCCGGAC GCGGTCGCCG TCAGCCTCGA CCATCCGGAA CTGACCCGCT TCGTCAACGG CGTGCTGGCC CGCGCGATCG CCGACGGCAC CTGGAAACGG CTGGCCCGGC GGTGGCTGTC GGCACCGTTC GCCCCGCCGC CGACGCCACC GGTCGCCCGC TACCGGGACT GA
|
Protein sequence | MMIIRVATAR AVTVLITVAL AVGVTACGTV DSPLPAQARP LAPATPTPPR PAALTPAALT PAAAAVTPTP TCDDPQASWR PPGRLPSPGA MPTGTVLETI ERRGYLIAGV LADVPPFGSI SPFTGQFEGF DVEIANLVGR RIFGADGHVR FRAVTYAERI PVLRDGAVDV VVATMTTNCE RRALVDFSAV YYNETQRVLV PRDSPYQGMD DLGGRRVCTA AGAAAGAAAT IRRAPSRPVL RTVPNIADCL VLLQAGEVDA VMTTTAILNG MAAQDPRLYV VGPALSDEPD AVAVSLDHPE LTRFVNGVLA RAIADGTWKR LARRWLSAPF APPPTPPVAR YRD
|
| |