Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5378 |
Symbol | |
ID | 5673711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6486899 |
End bp | 6488623 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641244235 |
Product | selenium-binding protein |
Protein accession | YP_001509641 |
Protein GI | 158317133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGACT GGCATGCGGC AGGCGCGGAT GAATCGGACT TCCGGACCCC ACGGCCACGG TGGTCACGTT GTGCACAGGG CTTGGCCTTA CCGTTTTCCC GGCTCGTCTC CCACACCAAA CCGCTCCGCC GCCAATCGGT GAGACTTACG CACCGAATCC GAGGCCAGCC CCCGGTTTCA GTATTTTTAA CCGGGGCAAT GAAGATACCT GGACAGGCTG ACCTCAAATC GCGGACGGAC GTGGTTGGAC GGCCTTGCGA GCTGGACCGC CCTGGTCGTC CCGATCGATC CTGGTCGTCC CGATCGAGGA GGGTAGCTGT GAGCAACCCG GATCCGACTT TCTACCGCTC ACCCGGCGAC GCTGCGGCGG CCCCTCCGGA GCAGCTGGCC TATGTCGCCG CTTTCGATCC GAAGGGAGCC GTATCGGATG CGATGACCGT CGTCGACGTT GATTCCACCT CGTCCTCCTA TGGCCGGATC GTCGGTTGGA CCGACCTATC GGGACGCGAC GACGAATTGC ACCACTTCGG CTGGAACGCC TGCAGCAGCG CGCTCTGCCA CGGGCACCAC GGACACGCCG CCAACCTCGA GCGACGTTAT CTCGTCGTCC CGGGGCTGCG ATCTTCCCGG ATCCACATAC TGGATACCCG GCCGGATCCC CGCAGCCCGC GCGTGGTGCG CACGATCGAG GCCGCCGAGC TCGCCACCAA GGCCGGTTAC TCCCGGCCGC ACACCGTCCA CTGCGGGCCG CATGGTCTTT ACCTGTCGGC GCTGGGCGGG GCAAACGGCT CCGATGGACC CGGCGGGGTG GCGCTACTCG ATCATGACAC GTTCAACGTG GTCGGGCCCT GGGAACGCGA CCGCGGCCCG CAGTACCTGG CGTACGACGT GTGGTGGCAC CTCGCGCAGG GAGTGGCGGT CACCTCCGAG TGGGGTACGC CGTCAATGAT CGAGGACGGG GTCAACGCCG AACTCCTGCT CGGCCGGGAG TACGGCCACG CTCTGCACTT CTGGGACCTC GACAGCGGTC GCCACCGGCA GACCGTCGAT TTGGGTGACG ACAACCAGAT GGTGTTGGAG GTGCGCCCGG CCCACGATCC GCGTGCCTCC CACGGCTTCG CCGGCGTCGT CACAAACGTC ACGGATCTTT CCGCGTCTGT GTGGCTGTGG TACCGCGACG GTGACCGGTG GGCCGCGCGG AAGGTCATCA CGATCCCGCC CGAACCCGCG GAGCCGGCTG ACCTGCCACC GGTCATCCGT CCGTTCGGCG CGGTCCCCGC CCTGGTGACC GACATCGACC TGTCCGTCGA TGACCGATGG CTTTACGTGT CCTGCTGGGG AACCGGGGAG CTCAAGCAGT TCGACGTTCG GAACCCGTTC TCACCACGGG AGGTCGGGTC GGTGCGCCTC GGCGGGATCG TGCGCCGCAC CCCCCACCCG GCCGCCCCGG ACGAGCCGCT CGCCGGCGGG CCGCAGATGG TCGAGATCAG CCGTGACGGA CGTCGGGTCT ACCTGACGAA CTCGCTGTAC GCGGCGTGGG ACGACCAGTT CTACCCGGAC GGCGTCGGCG CGTGGATGGC ACGCCTCGAT GCGGACGCGG ACGCGGGCGG GGTCAGCCCG GACGTCCGCT TCTTCCCACG CGGCGCCGAC TTCCGCGGCC TACGCGTGCA CCAGATCCGG CTGGAAGGCG GCGACGCCTC GTCAGACTCC TACTGCTTCG CATGA
|
Protein sequence | MNDWHAAGAD ESDFRTPRPR WSRCAQGLAL PFSRLVSHTK PLRRQSVRLT HRIRGQPPVS VFLTGAMKIP GQADLKSRTD VVGRPCELDR PGRPDRSWSS RSRRVAVSNP DPTFYRSPGD AAAAPPEQLA YVAAFDPKGA VSDAMTVVDV DSTSSSYGRI VGWTDLSGRD DELHHFGWNA CSSALCHGHH GHAANLERRY LVVPGLRSSR IHILDTRPDP RSPRVVRTIE AAELATKAGY SRPHTVHCGP HGLYLSALGG ANGSDGPGGV ALLDHDTFNV VGPWERDRGP QYLAYDVWWH LAQGVAVTSE WGTPSMIEDG VNAELLLGRE YGHALHFWDL DSGRHRQTVD LGDDNQMVLE VRPAHDPRAS HGFAGVVTNV TDLSASVWLW YRDGDRWAAR KVITIPPEPA EPADLPPVIR PFGAVPALVT DIDLSVDDRW LYVSCWGTGE LKQFDVRNPF SPREVGSVRL GGIVRRTPHP AAPDEPLAGG PQMVEISRDG RRVYLTNSLY AAWDDQFYPD GVGAWMARLD ADADAGGVSP DVRFFPRGAD FRGLRVHQIR LEGGDASSDS YCFA
|
| |