Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1420 |
Symbol | |
ID | 3903401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1711502 |
End bp | 1712542 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878757 |
Product | hypothetical protein |
Protein accession | YP_480526 |
Protein GI | 86740126 |
COG category | [R] General function prediction only |
COG ID | [COG0325] Predicted enzyme with a TIM-barrel fold |
TIGRFAM ID | [TIGR00044] pyridoxal phosphate enzyme, YggS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0186857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0858519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCACC GGCCCGAGAT GCCCGCGAGG CATGGCGCAT CCGGCTCGGG GGAGTCCGAC GGACCGTTCG ACCCGGACCG GCCTGCCCCG GACCGGCCTG CCCCGGACCG GCCTGCCCCG GACCGGCCTG CCCCGGACCG GCCTGCCCCG GACCGGCCTG CCCCGGACCG GCCTGCCCCG GACCGGCCTG CCCCGGACCG GCCCGCCCCG GACCGGCCCG CCCCGATCGA GCTCGATCCG GCGCGCCTGG ACCGGTTGAC GCAGCGGCTG GCCGAGGTCC GGGCTCGGAT CGCGGGGGCG GCCCGGGCCG CGGGCCGTGA TCCGGACCAC CTCACCCTCA TTGCGGTCAG TAAAACCTAC CCACCCCAGG ATGTTGTGAT GATGCACACG CTCGGGGTGC GGCACTTCGC CGAGAACCGG GAGCAGGAGG CCGGGCCGAA GGTGAGTCTC GTCACCCGGC TGATCGGCGG GGAACGGAGC GTCCCGGCCA AGGGAACCGG TGACGGCCTG TCCTCCGGTG CCACCGGTTC CGACGATCCG ATCTGGCATT TCGTGGGACA ACTGCAGCGC AACAAGGCCA GATCCGTTCT TCGTTGGGCG GATTGGGTGC AGTCGGTGGA TCGGGTGAGC CTGGTGCCAG TGCTCTCCCG GCTGGCAATG GAACGCGGCC GCCCGCTGTC GATCTGTCTC CAGGTCTCGT TGGACCTCCC TGGTGCTTCC GATGGGAAGA TCGGCGCGTC GATCGCCGGC TCGAGGCGCG GAGGGATCGA TCCGGCCGGC CTTTCCGCCC TGGCCGATCT TGTCGAGGAG GCGCCGGGAC TGGCCCTGCG AGGCGTGATG GCTGTCGCCC CCCGGAGGGG GCAGCCACGA CCTGCGTTCG CGCGACTGCG TGAGGTGGCG GAACGCCTGA AGGTGGGGCA TCCCCAGGCC ACCGTCATCA GTGCCGGCAT GTCGGGAGAT CTTGAGGACG CTGTGGCCGA AGGCGCGACA CACCTTCGGA TCGGCACCGC TTTGTTCGGT GAACGGCCTG GTGTCCCTTA G
|
Protein sequence | MIHRPEMPAR HGASGSGESD GPFDPDRPAP DRPAPDRPAP DRPAPDRPAP DRPAPDRPAP DRPAPDRPAP DRPAPIELDP ARLDRLTQRL AEVRARIAGA ARAAGRDPDH LTLIAVSKTY PPQDVVMMHT LGVRHFAENR EQEAGPKVSL VTRLIGGERS VPAKGTGDGL SSGATGSDDP IWHFVGQLQR NKARSVLRWA DWVQSVDRVS LVPVLSRLAM ERGRPLSICL QVSLDLPGAS DGKIGASIAG SRRGGIDPAG LSALADLVEE APGLALRGVM AVAPRRGQPR PAFARLREVA ERLKVGHPQA TVISAGMSGD LEDAVAEGAT HLRIGTALFG ERPGVP
|
| |