Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1695 |
Symbol | |
ID | 3903272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2035171 |
End bp | 2036184 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879033 |
Product | hypothetical protein |
Protein accession | YP_480800 |
Protein GI | 86740400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00508626 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0395345 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGC CCTTGCCGCA ACGCGCCCTA CGCCTCCCGC AGCGTCCCGC CGCCCGCGTC GTCGCCGACG CCAACCATCT CGCGAACCTC ACCCCTCACC TCAGCCCCCG CGACCGCTGG ATCACCCGGC TCCTTTACGA ACACCGGGTC CTGACCACTC ATCAGCTCGT CCACGCCGCC TGGACCAACC GTCGCACCGC CAACGAGCGG CTCCTCCAGC TCTACCGCTG GCGTGTCATC GACCGCTTCC AACCCCTGAG CCCCCTCGGC GAGGGCATGC CCCCGGCGCA CTATGTCTGC GACGTTGCCG GCGCCGCGAT CCTCGCCGCC GAAGACGGCA TCGACCTGGC CGCGACCGGT TACCGGCACG ACCGGGCACT CGGCGTCGCC TACTGGCCCC AGCTCGCCCA CCGCGTCGCG GTCAACGGCT TCTTCACCCA CCTCATCGCC CACGCCCGAC AGCCCAACCC GCCCGGCACG CTCACCGCCT GGTGGTCCGA GGCTCGCACC CGGGCCGCGT TCGGCGACAT CGTCCGTCCC GACGCCTACG GACGGTGGAC CAGCCGCGGC AGCGACCTGG AATGGTTCCT CGAACTCGAC TGGGCCACCG AGCCGTACGC CCGCCTCGCC GCGAAGATCG ACAAATATGG GCGGCTCGCC TCCGCGACCG GCATCACCAC CCCGGTCCTG TTCTGGTTCC CCACCATCGG CCGGGAGACC CGCGCCCGCC GCGCGCTGGC CGACGCCGTT GCCGGGCTCG ACCAGCCGCA CACCGTCCCG GTCGCGACCA CCGCGGCCAC CCTCGCCCCG CCCGACGACC AGCTCGACCC GGCCCTCGCC CGCTGGCTGC CGCTCGGCGC CAGCCGCCCC GGCCGGCTCA CCCTCGATCA GCTGCCCCGG GCCTGGCCCC GTCTGCCGGC ACCCGCGCCG GTGTCGGACC GGCCGGATGC CATCTCCGCC GGGTCGGGCC TGCGCCCGCC TGCTCCGATG CCGCCCGCCA AGTACCGGGG GTGA
|
Protein sequence | MAKPLPQRAL RLPQRPAARV VADANHLANL TPHLSPRDRW ITRLLYEHRV LTTHQLVHAA WTNRRTANER LLQLYRWRVI DRFQPLSPLG EGMPPAHYVC DVAGAAILAA EDGIDLAATG YRHDRALGVA YWPQLAHRVA VNGFFTHLIA HARQPNPPGT LTAWWSEART RAAFGDIVRP DAYGRWTSRG SDLEWFLELD WATEPYARLA AKIDKYGRLA SATGITTPVL FWFPTIGRET RARRALADAV AGLDQPHTVP VATTAATLAP PDDQLDPALA RWLPLGASRP GRLTLDQLPR AWPRLPAPAP VSDRPDAISA GSGLRPPAPM PPAKYRG
|
| |