Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5254 |
Symbol | |
ID | 5673588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6316086 |
End bp | 6317513 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244109 |
Product | VWA containing CoxE family protein |
Protein accession | YP_001509518 |
Protein GI | 158317010 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.697295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.473952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGC TCGACCGCAC CGTCGAGTTC ACCGCCGCGC TGCGCCGCGC CAACGTCCCG GTGAGCAGCG CCGAGACGGT GGACGCCGCG CGGGCGGTCG GTGCCATCGG CTGGGCCGAC CGGGACGCCC TGCGGGCCGC GTTCGCCGCG ACGATGTGCA AGCGCCCGCT GTACCGGAGC GCGTTCGACT CGCTGTTCGA CCTGTACTTT CCGCCGCGGA TCGGCGACGG TGTCGTGCTG CCCGACGAGG GCGGGCCCGC CGGGGAGCAG CAGCCGGGGG AGCAGCGGGA CGGCGACCCG TCCGAGCTGA CCCCGCAGGA GATCGAGGCG CTGCGCCGGG CCATGCGCGA CCAGCTCCGC GACGCGCTGC TCGACGGCGA CGACGAGAAG CTCCGCGACC TGGCCCGCCG TTCGGTCAGC GCCTTCGGCG CGGCGCAGAA CGGGCCGGGG CAGCGCAGCT ACTTCATCTA CCGGGTGCTG CGGGCGATGT CCCCGGAGAC ACTCATCGCG GACCTGCTCG CCGCGATGCT CGGCGACGAC GATCGCGGCG GCCTCGAAGA GCGGATCGCG CGGCAGACGA TCGCCGACCG CATCCGGGCG TTCGAGGAGA TGATCTCCTC CGAGGTCCGC CGGCGGATGG CCGAGGAACG CGGCATCGAG GCCGTCGAGC GGACCGCGGT GAAGCCGCTC GCCGACCAGG TCGACTTCCT GCGTGCCTCC CAGCGTGATC TGGTCGAGCT GCGCCGCCAG GTGTATCCGC TCGCCCGCCG GCTGGCGACC AGGCTGACGG CGCGGCGCCG GCTCGGCCGG GCCGGCCGGC TCGACTTCCG CCGCACCGTC CGGGCATCGC TGGCGACCGG TGGCGTGCCG ATCGAGACCA AGCACCGGCC GCACAAGCCG CACAAGCCCG AGCTTGTCGT GCTGTGCGAC GTGTCCGGAT CGGTGTCCTC CTTCGCGCAC TTCACCCTCA TGCTCACGCA CGCGCTGCGC GAGCAGTTCT CCAAGGTCCG CGCGTTCGCG TTCATCGACA CGACCGACGA GGTCACCCGC TTCCTGCGCG GGCTCGAGCT GGGCGACATG ATGGCCCGGA TCGCCTCCGA GGCCGACCTG GTCTGGTTCG ACGGCCACAG CGACTACGGC CACGCGATCG AGGTCTTCGC CGAGAAGTAC CCGGACGCAG TCGGGCCGCG GACGTCGCTG CTCGTCCTGG GGGATGCCCG CAACAACTAC CGGGCCACCT CGGCCGCGGT GTTCCGCCGG CTGTGCGGGC AGGCCCGGCA CTCCTACTGG CTGAACCCGG AGCCGCGCAG CTACTGGGGC TCCGGCGACT CGGCCACCAC CGCCTACGCG GACCTCGTCG ACGAGATGGT CGAGTGCCGC AACGTCGAGC AGCTCCAGCA CTTCATCGAG CGTCTGCTAC CCACCTGA
|
Protein sequence | MNLLDRTVEF TAALRRANVP VSSAETVDAA RAVGAIGWAD RDALRAAFAA TMCKRPLYRS AFDSLFDLYF PPRIGDGVVL PDEGGPAGEQ QPGEQRDGDP SELTPQEIEA LRRAMRDQLR DALLDGDDEK LRDLARRSVS AFGAAQNGPG QRSYFIYRVL RAMSPETLIA DLLAAMLGDD DRGGLEERIA RQTIADRIRA FEEMISSEVR RRMAEERGIE AVERTAVKPL ADQVDFLRAS QRDLVELRRQ VYPLARRLAT RLTARRRLGR AGRLDFRRTV RASLATGGVP IETKHRPHKP HKPELVVLCD VSGSVSSFAH FTLMLTHALR EQFSKVRAFA FIDTTDEVTR FLRGLELGDM MARIASEADL VWFDGHSDYG HAIEVFAEKY PDAVGPRTSL LVLGDARNNY RATSAAVFRR LCGQARHSYW LNPEPRSYWG SGDSATTAYA DLVDEMVECR NVEQLQHFIE RLLPT
|
| |