Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3159 |
Symbol | |
ID | 5671536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3721799 |
End bp | 3724198 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641242054 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001507474 |
Protein GI | 158314966 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGGGATG AGGTCAGGCT CCCTCATGAC GCGATGGTGG CTCGTGAACG TAACCGTTCC GACGCCGCGG CGGGCCAGCG CGGCTTCTAC CCGGCTGGGG CGTACCAGTA CAGGAAGTCG TTCTTCGTGC CGGAGGAGTA CCGGAACCGG CGCGTCACCT TCGAGTTCGA GGGTGTCTAC CGGAGCGCCA GGGTTTTTCT CAATGGTGGC CTCGCCGGAC AGCATGCCTA CGGCTACTCC CACTTCTACG TCCGCGCCGA CCATTTCCTG AAGTACAACG AGGACAACGA GATCCTGGTG GAGGCACACA GCGCCGATGA CACCCGCTGG TACTCGGGTG GTGGCCTGTA CCGCAACACG AAGCTCATCG TCGGCGATCT CGTGCACATC GGCCTGGACG GGGTGAAGGT CACGACGCCG GCCATCGACG CGGACCTTGC CCTAGTCGCG GTGGCGACGC AGGTGCACAA CGAGTCCTCG GTCACCCGGA CCGTCGAGGT GACCACCGAG ATCGTGGGCG CCGACGGTGT CGTCGTGGTC CGTGACGTCG CCCCGCTCAC CCTGTTCACG GACGATCCGG TCACCGTACG CCAGCGGCTG CCGGTGCCGC GGCCGCAGCT GTGGGGAGTG GAGCACCCGC ACCTCTACAC CTGCCGGACC AGAGTGACGG CGGACGGAGA GCTCCTGGAT GAGGAGGCCA CCCGTTTCGG TGTCCGGTCG CTCACCGTCG ATCCGCACTG GGGTCTGCGC ATCAATGGTG GGGTGGTGAA CCTCCGCGGC GCGTGCATCC ACCACGACAA CGGCGTGATC GGTGCCGCCA CGATAGACCG GGCCGAACAG CGCCGGGTCG AGATCCTGAA ACAGGCCGGT TTCAACGCCA TCCGTAGTTC CCACAATCCG ATCAGCAAGG CCCTCCTCGA CGCCTGCGAC CGGCTCGGCA TGCTCGTAAT CGACGAGTTG TTCGACGCAT GGACCCGGTC GAAGGTCGCG CAGGACTACG CCCTCGACTT CCCCCTCTGG TGGAAGTCGG ACGTGCAGGC GATGGTCGAC AAGGACTTCA ACCACCCCTG CGTGATCCTC TACTCGATCG GGAACGAGAT CCCGGAGACA GGCACCGCTG CCGGCGCGGC GATCAGCCGC CAGCTCGCCG AGAGGATCCG AGCCATCGAC GACACCCGTT TCGTCACGAA CGGCGTCAAC GGCCTTCTCG CCGGCGGCCC CGAGCTGCTC GCGTCGTTCT CCAGCGGTGC TCGGGAGAAA AGCAGCGAGG CGGGCGAGGC GGTGGACGTC AACGCGTTCA TGAACAGGTT CCGCGAGTTT ATGCCGATCC TCATGGCCTC CGAAATGGTC GGTTCGAAGA CCGCGGAGTC GATGGCCTGC CTGGACGTCG CCGGCTACAA CTACCTGGAG TCACGGTACG AGCTGGACCG AACACTGTTC CCGAACCGGG TGATCGTGGG GACCGAGACC TACCCGTCGG AGATCGACAG GAACTGGCGG CTCGTCCAGG ACAACAGCCA CGTCATCGGT GACTTCACCT GGACCGGCTG GGACTATCTC GGCGAACCAG GAATCGGGCG GATCGAGTAC CAAGGCGACG AGGAAAACGC CAGCACCTCC CCATCCCACG GCAGTTATCC GTGGCTGACC GCGTGGTGCG GCGACATCGA CATCACCGGC CACCGCCGAC CGGCCTCCTA CTACCGCGAG ATCGTGTTCG GCCTACGCAG CGAGCCCTAC ATAGCCGTGC ACCGCCCCGA CCGCTACGGC CAGCCAGTTA CGGTGGCGAT GTGGTGGTCG TGGAGTGACG CGATCTCCAG CTGGTCCTGG GACGGCCACG AGACCAGACC GGTGCGGGTG GAGGTTTACT CGGCCGCCGA CGAGGTCGAA CTTTTGGTCA ACGGCCGGCT GATCGGTACC GTCCCGGCGG GGGAGAAGAA CCGGTTTAAG GCCGAGTTCG ACACCGTTTA CGAACCCGGC GAAATCGTCG CTGTCGCCTA CACCGCTGGC CGCGAGACCG GACGCACCCT GCTGCGCTCG GCGACCGGCG AGGTCCGCCT CGCCGTCGCC GCCGACCGCA CCGACATCGT CGCCGACGAC ACCGACCTTG CCTACATCGC CATCACCCTC GTCGACGAGG CCGGCAACCT CTACAACACC GCCGACCGCA CGGTCGCCGT CGAGGTGGCG GGACCCGGCG TGCTGCAGGG CTTCGGTAGC GCGGACCCGA AAACGGAAGA GAACTTCTTC GATACCACCC GCGCCACCTT CGACGGCCGG GCACTCGCCG TCATCCGCCC CACCGCCCCC GGCACGATCA CCGTGACCCT CACCGCGCAG GGATGCGAGC CCTCCACTAT CCGCATCGAA GCCGAACTTA CGGCACGCTC AGGCGAATGA
|
Protein sequence | MWDEVRLPHD AMVARERNRS DAAAGQRGFY PAGAYQYRKS FFVPEEYRNR RVTFEFEGVY RSARVFLNGG LAGQHAYGYS HFYVRADHFL KYNEDNEILV EAHSADDTRW YSGGGLYRNT KLIVGDLVHI GLDGVKVTTP AIDADLALVA VATQVHNESS VTRTVEVTTE IVGADGVVVV RDVAPLTLFT DDPVTVRQRL PVPRPQLWGV EHPHLYTCRT RVTADGELLD EEATRFGVRS LTVDPHWGLR INGGVVNLRG ACIHHDNGVI GAATIDRAEQ RRVEILKQAG FNAIRSSHNP ISKALLDACD RLGMLVIDEL FDAWTRSKVA QDYALDFPLW WKSDVQAMVD KDFNHPCVIL YSIGNEIPET GTAAGAAISR QLAERIRAID DTRFVTNGVN GLLAGGPELL ASFSSGAREK SSEAGEAVDV NAFMNRFREF MPILMASEMV GSKTAESMAC LDVAGYNYLE SRYELDRTLF PNRVIVGTET YPSEIDRNWR LVQDNSHVIG DFTWTGWDYL GEPGIGRIEY QGDEENASTS PSHGSYPWLT AWCGDIDITG HRRPASYYRE IVFGLRSEPY IAVHRPDRYG QPVTVAMWWS WSDAISSWSW DGHETRPVRV EVYSAADEVE LLVNGRLIGT VPAGEKNRFK AEFDTVYEPG EIVAVAYTAG RETGRTLLRS ATGEVRLAVA ADRTDIVADD TDLAYIAITL VDEAGNLYNT ADRTVAVEVA GPGVLQGFGS ADPKTEENFF DTTRATFDGR ALAVIRPTAP GTITVTLTAQ GCEPSTIRIE AELTARSGE
|
| |