Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1641 |
Symbol | |
ID | 3905920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1973361 |
End bp | 1975127 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878979 |
Product | extracellular solute-binding protein |
Protein accession | YP_480746 |
Protein GI | 86740346 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0558371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGTGC TCGCCACGAT CCTGCTCGCC ACGGTCCTGG CTGCCACGGT GACCGCCTGC AGCGGTGGCT CGGGGGACGG ATCGCGTGAG CTCGCTGCCA GTCCCACCCC GGCGACGACC ACCACGCCCC CTCCGTCCGG CAAGCCGGGC GGCACGCTGC GGATCGTCAC CCAGTGGATG CCCAGTGGCG ATCCGGGCTG GGCCGACCAG CCCGGCGAGC GGGCGGTCAG CCGGCTCGTG ACCCGCCAGC TGTACAGCTA TCCCTCCGAC GAGGACACGA CGAAGTCGAC CATCCCGCGG CCGGACCTCG CCGTCGGCGC GCCGGTCGTC ACCGAGAACG GGCTGGTCTA CACGGTGCGG CTGCGTCCCG CGGCGCGCTG GGACACCCCC AACCAGCGCC GGATCACCGC CAACGACGTC GCCCGCGGCA TCAAGCGACT GTGCACGCCC CCGAACCCGT CACCGCTGCG CGGCTACTTC ACGGCGACCA TCGTCGGATT CCGGGAGTTC TGTGCCCAGC TGGCGGCGAC CCCGGTCGCC GACGCCGCGG CCTTCGTCGA GAGCAGCACC GTGGAGGGCA TCGAGATCGT CGGCGACGAC ACGCTCGCGT TCCACCTACT GGCCCCGGTG AACGACTTCG TGGACGTGCT GGCGCTGCCG GCGGCCTCCC CGGTGCCGCT GGAGGCCCTG GCCTACCCGC CGGACTCGCT GGAGTACCTG CACAACCTGG TCTCGGCCGG GCCGTACCGG TTCACGGTCG CCCCCGGCGA GGGGTACCGG CTGTCGCGCA GCCCATCGTG GAGCGCGTCC TCGGACGGGA TCCGCCGCGC CCTGCCCGAC CATATAACGA TCTTCGACGG CCTGAGTCCG GAGGCCATGC AGCAGGAGCT GGAGAGCGGC GACGCGGACA TGTCGCTCGA CGGGAAGATA CCCGACAGCC GGGCCGTGGA GCTCGCCAGG GCGAACGACC CCCGTCTCGT GGTCGACGGG GTCGGGGTCA CCCTGGCGCT CACCGTCGGA TTCAATGGGC CCTCCGCGGC CGCGCTGCGT GAGCTGTCGG TCCGCCGGGC GCTGCCCTAC TGCATCGACC GGGTGAGCCT CGCGGCGGCG CTCGGCGGCC CCGAGTTCGC CGCTGCCGCC ACCGGGTTGC TGCAGGAGAC GATGACCGGC TACACCGACG CGGATCCCTT CCCGAGCCCG GCGGGACTCG GCGACGCGGC CCGTTGCCGG GAGGCGCTGA GCCATACCCC CGGCGGACCC GTCACGGCGC TGTCCCTGCT CACCACGGAC AGCGCCACCG ACGTGGCGGC CGCCGAAGCA CTGCGGACCG CCTTCGCCCG TTCCGGTATC CGCCTCGACA TCCGGATCCG GACCGGCGAG CGCTACCGTG CGGCAGCCGT CCACCCGACC GGGCAGTTCT GGGACCTGGC GCTCACTACT ATCGCCCCGG ACTGGTACGG CGACGCGGGC CGCACCGTCT TCCAGCCGTT GCTGGACGAG ACGTGGGCCG GCCCCAGGCC GGCCGACGGC GGCTACCGGG ATCCGGGCGC CCTGCATCTG CTCGCCACGG CGCTGCGGGC CACCAGCGAG GCGACCGCCG CGAGCAACTG GGCCGACCTG GAGCACACGC TGGTCGAGCA GGTCGCGGTG ATCCCGCTGG CCGTCGTGCA CACCCCGCAG TTCCACAGCA CGAACGTCAC CGCGTTCACG ATCGTGCCGT CGATCGGCAC CGCCGATCCG ACCGCGGTCT CGCTGGGGTC CGGATGA
|
Protein sequence | MSVLATILLA TVLAATVTAC SGGSGDGSRE LAASPTPATT TTPPPSGKPG GTLRIVTQWM PSGDPGWADQ PGERAVSRLV TRQLYSYPSD EDTTKSTIPR PDLAVGAPVV TENGLVYTVR LRPAARWDTP NQRRITANDV ARGIKRLCTP PNPSPLRGYF TATIVGFREF CAQLAATPVA DAAAFVESST VEGIEIVGDD TLAFHLLAPV NDFVDVLALP AASPVPLEAL AYPPDSLEYL HNLVSAGPYR FTVAPGEGYR LSRSPSWSAS SDGIRRALPD HITIFDGLSP EAMQQELESG DADMSLDGKI PDSRAVELAR ANDPRLVVDG VGVTLALTVG FNGPSAAALR ELSVRRALPY CIDRVSLAAA LGGPEFAAAA TGLLQETMTG YTDADPFPSP AGLGDAARCR EALSHTPGGP VTALSLLTTD SATDVAAAEA LRTAFARSGI RLDIRIRTGE RYRAAAVHPT GQFWDLALTT IAPDWYGDAG RTVFQPLLDE TWAGPRPADG GYRDPGALHL LATALRATSE ATAASNWADL EHTLVEQVAV IPLAVVHTPQ FHSTNVTAFT IVPSIGTADP TAVSLGSG
|
| |