Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5130 |
Symbol | |
ID | 5673464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6145802 |
End bp | 6146902 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243980 |
Product | hypothetical protein |
Protein accession | YP_001509394 |
Protein GI | 158316886 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.355874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0671024 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGCTC CCCGCGCGGA TCCGGCCGCG CCCCAGCCCG CGGCGACGTC GCCGTCGGTC GAGCGGACGC TGCGCAGCCT CGAACTCACC GTGACCCGCC GGCTGGACGG CATGCTGCTC GGCGATCATC TCGGCCTGCT GCCCGGCCAG GGCACCGAGA AGGCCGAGAG CCGGGAGTAC AACGTCGGCG ACGACGTCCG CCGGATGGAC TGGGCGGTCA CCGCGCGGAC GACCGTCCCG CACGTGCACG ACCTGATCGC CGACCGGGAG CTGGAGACGT GGGCGCTGGT CGACCTGACG GCCAGCCAGG AGTTCGGCAC CGCCTCGGTC CGCAAGCGCG ATCTGGCGAT CGCCGCGGTG GCGGCGATCG GCTTCCTCAC CGCCCGCACG GGCAACCGGA TGGGAGCCGT GGCCCTCACC CCGGCCGGGC CACGGGTCAT CCCCGCCCGG CCCGGCCGCC AGGGCCTGCG AACGCTGCTG CGGACCCTGC TGACGGTCCC CGAGGGGGCG CACGACCGGC CGCTGCGCCG GCCCGACCCG GCGGCCGCCA CCGATCTCGC CGCCGCAATC GCCGCCCTGG ACCGCCCGCG CCGGCGCCGT GGCCTCGCGG TGGTCGTCAG CGACTTCCTC TCCACCGACC TCGGCTGGGA ACGGCCGATG CGCGTCCTCG CGGCGCGCCA CCAGCTCCTC GCGGTCGAGG TCCTCGACCC GGCCGAGCTG ACGCTGCCCG CCGTGGGCCT GCTTCCGGTC GTGGACGCGG AGACCGGCGA GCTGGTGGAG GTTCCGACGT CCTCACGGCG ACTGCGTGAG CGCTACCGCC TGGCCGCGGC CGAGCACCGC TCCCAGGTCG CCCTCGCGCT GCGCCGGGCG GGCGCCGGGC ACCTGGTGCT GCGCACCGAC TCCGACTGGC TGATCGACAT CGTCCGCTTC GTCTCGGCGA GCCGGACGAG CCGCGGCGCG GCACGACGCC CACCCGTGGA CTCGACCCGG CTACCGGGCC ACCCGCGATC GCTCCCGCCG GCCACCGGCC GGGGTCGACC CGGGACAGCA GCTGTGGTCG GGGCAGGCGG CCGTCGAGGC AGGCGGGCGG CGGCGCCGTG A
|
Protein sequence | MTAPRADPAA PQPAATSPSV ERTLRSLELT VTRRLDGMLL GDHLGLLPGQ GTEKAESREY NVGDDVRRMD WAVTARTTVP HVHDLIADRE LETWALVDLT ASQEFGTASV RKRDLAIAAV AAIGFLTART GNRMGAVALT PAGPRVIPAR PGRQGLRTLL RTLLTVPEGA HDRPLRRPDP AAATDLAAAI AALDRPRRRR GLAVVVSDFL STDLGWERPM RVLAARHQLL AVEVLDPAEL TLPAVGLLPV VDAETGELVE VPTSSRRLRE RYRLAAAEHR SQVALALRRA GAGHLVLRTD SDWLIDIVRF VSASRTSRGA ARRPPVDSTR LPGHPRSLPP ATGRGRPGTA AVVGAGGRRG RRAAAP
|
| |