Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0449 |
Symbol | |
ID | 5668871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 529072 |
End bp | 530640 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641239381 |
Product | extracellular solute-binding protein |
Protein accession | YP_001504819 |
Protein GI | 158312311 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGACG GCGCATTATC CCGCAGGCAG GTACTCGGCG CCGCACTCGC TACGGTGGCA CTCGCCGGCG CGAGTTCGTG CGCCGGCGAG GAACAATCGT CCGGCGACGG TGGCAGCGGA CGGCCGTCCG CGTCGCGCGA ACAGACATTG TCCCTTGCCA TCCAGGCCAC TCCGAATTCC TTCGATCCCG CTGAACTGAC AAGCGGCCAG TCCTCGTTCG TGTGGAGCGC CCTCTACGAC ACCCTGATAT GGAAGGACAA TAAGGGCAAG TTACAGCCCA ACGCGGCAGA GAGCTGGAAC TACTCCGACG GCGGGCGCAC ACTGACATTG AAGCTGCGCA AGGGAATGTC CTTCAGCTCG GGTGCCCCGG TGAACGCGGT CGCGGTGAAG ACCACCCTCG AGCGGAGCAA GAACACGCCA GGATTCACCG ACCAAGTCCT CGGCGCGCTC GAATCGGTCG ACGCGCCCGA CGACCGCACC GTCGTCCTCC GACTCTCCCA TCCGGACGGC GCACTGCTGG ACTCGCTGGC AGTGAGCGGC GCGGGCGTGA TCGGCGATCC AGCGACGCTG AACGACAAAC GTACCGCGCT GAACCCGGTC GGTTCGGGCC CATACGTCCT GAACACCGGA CAGACGGTGA ACGGATCGAC CTACGTGCTC GATCGCCGCG AGGACTACTG GAACGTTCAG GCGTACCCGT TCAAGACCGT CAAAATCTCG GTCATCCGGG ACCGAACCGC TGCCCTCAAC GCCCTGCAGG CGGGTGAGGT CAACGCCGGT ACCGTCGAGG TGACAAACGT GGACCGGCTG CGGGCGGCCG GCTTTGACGC CGCCGTCGTC GAGGCCCACT CGCTGGCCTC GCTGGTTCTC GCCGACCGTA CAGGGGAGTC GCTCAAACCG CTGGGCGATC CACGGGTCCG GCAAGCCATC AACATGGCGT TCGACCGCGA GAAGATCGTC GAACAGCTGC TCAGGGGCTC GGGTAAGCCG ACCGAGCAGG TGTTCAACCC CAAGGACCCG GCGTATGACC CGGCACTGAA CACGACGTAC GCCTACGATC CACAGCGCGC GAAGAGACTG CTGGCCGAGG CCGGATATCC CAACGGATTC TCGGTAACGA TGCCGGAATT TTTCTTAGCC AAGTCGTTCG CACCGACAAT CACCCAGTCC CTGGCCGCTA TCGGAATCAC GACGACGTGG GAACCGGTTC CCCCACAGCA GACCGACGCG GCGATCAGCT CGAAGAAATA TCCGGCGTTC TTCCTAATTG CCGGGCTGGA GACGACTGCG GGTGACGCGT CCAGATATTT CTCCAAGGAC GGAGCGTTCA ACCCCTTCCA CGCGGAGGAT CCGGACCTCA CGCCACAGGT GGAGCAGGCG ACTCAGACAA TTGATCCGCG GCAGGCAGCC GATGCCTACA GGCATGTCAA CGCCACCGCG GTCCGGGATG CGTGGAACGC CCCCCTCTTC TACGTCGCGG TCCACTGGGT AACCAAAAAA GGCATCACCT ATCTCGGTGA CGGCTCGCTG ACGTTCAACA CCGTTCGCGC CTTCGGCCTG TCCGGATAA
|
Protein sequence | MIDGALSRRQ VLGAALATVA LAGASSCAGE EQSSGDGGSG RPSASREQTL SLAIQATPNS FDPAELTSGQ SSFVWSALYD TLIWKDNKGK LQPNAAESWN YSDGGRTLTL KLRKGMSFSS GAPVNAVAVK TTLERSKNTP GFTDQVLGAL ESVDAPDDRT VVLRLSHPDG ALLDSLAVSG AGVIGDPATL NDKRTALNPV GSGPYVLNTG QTVNGSTYVL DRREDYWNVQ AYPFKTVKIS VIRDRTAALN ALQAGEVNAG TVEVTNVDRL RAAGFDAAVV EAHSLASLVL ADRTGESLKP LGDPRVRQAI NMAFDREKIV EQLLRGSGKP TEQVFNPKDP AYDPALNTTY AYDPQRAKRL LAEAGYPNGF SVTMPEFFLA KSFAPTITQS LAAIGITTTW EPVPPQQTDA AISSKKYPAF FLIAGLETTA GDASRYFSKD GAFNPFHAED PDLTPQVEQA TQTIDPRQAA DAYRHVNATA VRDAWNAPLF YVAVHWVTKK GITYLGDGSL TFNTVRAFGL SG
|
| |