Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3151 |
Symbol | |
ID | 5671528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3706563 |
End bp | 3707864 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641242046 |
Product | hypothetical protein |
Protein accession | YP_001507466 |
Protein GI | 158314958 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.134749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAT GGCGCTGCAG GCAGTCCCAG ATAGTAGTCG GCACCGCCAC AGCCCTCATT CTTGTGGCGG GCTGCAGTGT CGAGGCGAAC GACGAGTCGA CGAATCCTTC GACAGGCACG CCAGCGGTAG CACCCGCGCC GGACATCTCA CCGGGTGTCA CAGCCAATAC CATCAAGATC GGCTTCGTGT ACCCCAACCT CTCCGCCGTC AAGAAGTTCA TAAACATCGA TCACGGCGAC TACGAGGCAA CCTTCACGGC ATTGGTCGAC AAGGTTAACT CCTCCGGCGG CATCAACGGC CGGAAGCTCC AGCCCGTCTT CGGCGCGGTT GACGTCACCT CGCCTGCTGG CGCTCAGGAG ACCTGTCTCA AACTGACCCA GGACGAGAAG GTTTTCGCTG TCCTCGGCAG TCTCAGCGGC GACGAGCCGC TCTGCTACAT CCAGACGCAT AGGACGGCGC TCGTCGGCGG CACGCTCTCG CCGGATCGCT ATGCCAAGGC CCAGGCACCC TGGTTCTCAT ATCAGCGAGG CGGCGACGAG GCTGCGGAGG GCATCAAGCT CTTTGCCGCG GACGGCGGCT TGGACGGGAA AGTCGCGATC GTCTCCTCTC TCAACGAGGA GGGTGTTATG AAGACGGCCA TCATGCCAGC TCTGAAGGAG CTCGGTATCA CACCGGTGGC AGCCGGGGTG CTCGATGCAC CGGCCACGGA TCCGGCAGCC GTTGCTCAGC AACTCAACGT CTTTGTGCAG AAGTTCCAGT CCGCTGGCGC CGACACCGTG ATCGTTGTCG GTGGGGTCTC GAGCGAGTTT CCCAAAAGCC TGGAGAAAAC CGACTATCGC CCCAAACTTC TCTTCAGCGA AATCAACCAG GCCGAACTCT ATTCGAACGA TCCCGGCGAG CATGATTTCA GCACACTGAA AGATGCGGCC GCCGTCGGCC TTGGTGTCAA CTGGAACGAC CCAGCGAACC TAGAATGTGT CAATACGCTC GTGGCAGCTC ATCCTGATCT GAAGGAGACA CTCATCGATC CGAACGACGT GGAGTCCGGA GAGCCTCAAC TGGGAGTTTC CGCGGGTATC GCCTGCAGCT CCCTCGCGCT ATTCACCGCT ATCGCAGAGA AGGCAGGCGG GACCCTTAGC TACAAGACAT TCCAGGATGC CGCTTTCTCC TTGGGTTCCT TCCATGTTCC TGGCTTCATG GACGATGCCA CATATGGCCC CTCCACACCC GATGGTCGGA TCCCGCCCCG CCTGTTCGAG TACAGCGCCA CAGAGAAGAA CTTCAAGATG TCCACAGGTT GA
|
Protein sequence | MRKWRCRQSQ IVVGTATALI LVAGCSVEAN DESTNPSTGT PAVAPAPDIS PGVTANTIKI GFVYPNLSAV KKFINIDHGD YEATFTALVD KVNSSGGING RKLQPVFGAV DVTSPAGAQE TCLKLTQDEK VFAVLGSLSG DEPLCYIQTH RTALVGGTLS PDRYAKAQAP WFSYQRGGDE AAEGIKLFAA DGGLDGKVAI VSSLNEEGVM KTAIMPALKE LGITPVAAGV LDAPATDPAA VAQQLNVFVQ KFQSAGADTV IVVGGVSSEF PKSLEKTDYR PKLLFSEINQ AELYSNDPGE HDFSTLKDAA AVGLGVNWND PANLECVNTL VAAHPDLKET LIDPNDVESG EPQLGVSAGI ACSSLALFTA IAEKAGGTLS YKTFQDAAFS LGSFHVPGFM DDATYGPSTP DGRIPPRLFE YSATEKNFKM STG
|
| |