Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2638 |
Symbol | |
ID | 5671032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3116319 |
End bp | 3117899 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241554 |
Product | hypothetical protein |
Protein accession | YP_001506974 |
Protein GI | 158314466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.316802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAC CCAGCCACCG CGTCGAGCAG GAGGAGCTGC GGGCGCGGAT GCGCGCGGTC GGTATGTCCC ACGACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCACTA CCGTCTCCGT GCCGCTCACC GGGTCGCGCA CGGCTGGACC CAGCAGCAGG CCGCAAACCA CATCAACGCC CACGCCGCCC GCACCGGCCT CGACCCCCAG GGCACTGCCC CCATGACTGC CCCCCGGCTG TCGGAGCTGG AGAACTGGCC GCTACCGAAC AACCGCCGCC GGCCCACCCC CCAGCTCCTC GCCCAACTCG CCGAGGTCTA CGACACCAGC ATCCACAACC TCATCGACCT CGACGACCGC GAACACCTCA CCCCCGCCGA CACACTCCTC ATAGCCCAGA TCCGCAAGGG CCTCCCACAG CCAACGCAGG ACGGTTCCGC ACCCGACGCC ACCGCGATCA GCAGCCGGGG ACCGCAGATC GATCTGCCTC GCATCGGACG CTCGCGTGTC CCTGCCGTTC CGATCGGTGT ATGCGGCGCC GTCCCGACCG AGATCGACCC CGGTGCGGAC ATCGACGCGG ACACGGCACT GCGCCGCGCG CACGAATGGC TGGTGACCGA ACCACCGCAG GCCGTGGAGA CCCGCACGGG ACGGCGGATC GGGGAGGCGT TCACCCGCAA GGTCGAGGGC CGCGTCGCAC AACTGCGCCG CCTGGACGAC TTCGTTGGCG GCCGGGACCT GTATGAGCTG GTCGCGCGGG AAGTCGCCGC CACGACCGCC GTGCTCGACG ACGCCGCCTA CGACGAGCAT CTCGGACGTC GACTGCGGTC CGCTGTCGCG GAACTGTGCC AGCTCGCCGG CTGGGTCGCG ATGGACGCCG GCCACACCCA GGCCGCGCGG CGCTTCTACC TCGACGGGGT GAAAGCCGCG CACGCCGCCG GCAACAGCCC GGTCGCGGCG AACCTGATCT CGACGTTGAG CTACCAGTTC GCCAACCAGC ACGACCCGCG CACCGCCATC CTGCTGGCCC GCACCGCCCT GCGCGGAGCG GAGAACTCCG CGACGCCGGC CACCCTGGCA CTGTTGTACG AACGCATCGC CTGGGCACAC GCGAAAGCCG GCGACCGGTC CGCCACGGAG AAGGCACTCG CCGCCGTGGA GCGTCACTAC GACCAGCGAC GTCCCGACGA CGAACCGACC TGGGTGTACT GGCTCGACGA CAACGAGATC CAGGTGATGG CCGGCCGCTG CTACGTCGAA CTCGGCCTCC CGCAGCACGC CGAGCCGCTG CTGGTCGACG CGGTCGCCCG CTGCGACGAA GACCACGCCC GCGAAGCCGC CCTTTACCGC TCCTGGCTCG CCGAGGCGTA CCTGCAGACA GGCGACATCG GCCGGGCCGT CGAAGAAGCC ACGCATGTCG TCCGGCTCGA CGCCCGCGCC GGATCAGCAC GCACCTCCGA CCGGGTCCAA CACCTGCGAG CCGGCCTCGC CGCGTTTCGC ACCGACCCGG CAGTCCGCGC CTTCGAGGAC CTCTACCAAT CCGAAGCGGA TCTTCCAAGC AACCTGCGTC GACCGAACTG A
|
Protein sequence | MATPSHRVEQ EELRARMRAV GMSHDEIAIE FARRYHYRLR AAHRVAHGWT QQQAANHINA HAARTGLDPQ GTAPMTAPRL SELENWPLPN NRRRPTPQLL AQLAEVYDTS IHNLIDLDDR EHLTPADTLL IAQIRKGLPQ PTQDGSAPDA TAISSRGPQI DLPRIGRSRV PAVPIGVCGA VPTEIDPGAD IDADTALRRA HEWLVTEPPQ AVETRTGRRI GEAFTRKVEG RVAQLRRLDD FVGGRDLYEL VAREVAATTA VLDDAAYDEH LGRRLRSAVA ELCQLAGWVA MDAGHTQAAR RFYLDGVKAA HAAGNSPVAA NLISTLSYQF ANQHDPRTAI LLARTALRGA ENSATPATLA LLYERIAWAH AKAGDRSATE KALAAVERHY DQRRPDDEPT WVYWLDDNEI QVMAGRCYVE LGLPQHAEPL LVDAVARCDE DHAREAALYR SWLAEAYLQT GDIGRAVEEA THVVRLDARA GSARTSDRVQ HLRAGLAAFR TDPAVRAFED LYQSEADLPS NLRRPN
|
| |