Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3976 |
Symbol | |
ID | 5672337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4761380 |
End bp | 4762663 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242855 |
Product | hypothetical protein |
Protein accession | YP_001508272 |
Protein GI | 158315764 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02678] conserved hypothetical protein TIGR02678 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.667235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.133993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCC AGCGCCGCAA CATCCCGCGC GGGTCGCGTC CCGGGTCGTC GTCGACAGCG GTCATCGACG CACTTGACGC GCAGCGCGCC GCCGAGCGCC GCCGGGCCAT GCGCGCGATC CTGCGCCGCC CGCTGCTCGT CGCGCACGGT CCCGACGCCG ACGCCTTCCG CCTCGTCCGC CGGCACCAGA CGTGGCTGCG GGACTGGTTC ACCCGAGAGA CGGGCTGGTC GCTGCGGGTC GACCCGGAGG TGGCCCGACT GGCCAAGATC CCGGCCGACC TGACCGACGG CACCCGCCCG GCCACGGCAG GATCGGCCCA GCAGCCGTTC GGCCGCCGCC GCTACGTCCT GCTCTGCCTG GCCCTGGCCG GCCTGGAACG GGCCGACAAC CAGATCACCC TGGGCAGCCT GGCCGACGAC GTGATGATGG GCTGCGCCGC CCCTGAGCTC GCCGAGGCGG GCGTCAGCTT CAGCCTCGAC AGCCGGGACG AGCGCGCCGA CCTGGTGGCG GCCGTCCGCG TTCTCCTTGA CCTCGGTGTG CTGCGCCGGG TCGCCGGTGA CGAGACGACC TTCACCACCG GCACCGGCGA CGCCCTCTAC GACCTCGACC GCCGGGCGCT GGCCGGCATG CTGGTGACCC GGCGCGGCCC CTCGACAGTC CGCGACATCC CAGGCCCGGC CGATGTGGAA GGCCGGCTGG CCGCCGTCGT CGAGGAACTG ACGGCCGACA CCGACGACGC GCGCAACCTG GCCCGTCGCC ACGCGCTGAC CCGGCGCCTG CTGGACGACC CGGTCGTCTA CTACGTGGAT CTGGACGAGG GCGAGCGGGC CTACCTGACC AGCCAGCGGG CCGTGCTGAC CCGGCGGATC ACCGAGGCGA CGGGCCTGGT CGCAGAGGTC CGCGCGGAGG GGATCGCGAT GGTCGACCCC GACGGCGACC TCACCGACAC CCGAATGCCC GAGGACGGCA CCGATGGCCA CGCCACCCTC CTGCTCGCCG AGCATCTCGC CCGCGAGGGC ACCCGTCTCG GCCCAGGTGA GCCGATAGCT GTCGCCGACC TCGACGCCCA CATGCGCGAG CTGATCGCCC AGCACCAGAA GCACTGGCGC AAGGGCGTCA CCGAGCCCGA CGCGGAGGCC GAGCTGGTCG ACCGGGCGCT GTCGCGGATG CGAGCCCTCG GCCTGCTGCG CCGGCGCGGG GACGACGTGT TCGCCCTGCC GGCGCTCGCC CGGTTCGCGC TCGGCGATCT GCGGGACGGC GGCGGCCAGG AGTCACTGGC ATGA
|
Protein sequence | MTTQRRNIPR GSRPGSSSTA VIDALDAQRA AERRRAMRAI LRRPLLVAHG PDADAFRLVR RHQTWLRDWF TRETGWSLRV DPEVARLAKI PADLTDGTRP ATAGSAQQPF GRRRYVLLCL ALAGLERADN QITLGSLADD VMMGCAAPEL AEAGVSFSLD SRDERADLVA AVRVLLDLGV LRRVAGDETT FTTGTGDALY DLDRRALAGM LVTRRGPSTV RDIPGPADVE GRLAAVVEEL TADTDDARNL ARRHALTRRL LDDPVVYYVD LDEGERAYLT SQRAVLTRRI TEATGLVAEV RAEGIAMVDP DGDLTDTRMP EDGTDGHATL LLAEHLAREG TRLGPGEPIA VADLDAHMRE LIAQHQKHWR KGVTEPDAEA ELVDRALSRM RALGLLRRRG DDVFALPALA RFALGDLRDG GGQESLA
|
| |