Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1784 |
Symbol | |
ID | 3904014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2121348 |
End bp | 2122412 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879122 |
Product | hypothetical protein |
Protein accession | YP_480889 |
Protein GI | 86740489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00707471 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAAGGTC ACCCGGTACG CGGTCGAACC GGTCGGGTCA TGCTGACCCG CAGCGTCCTC ATCGGCGCCT GCCTGCTGGG CTTCCTCGCG ACCACAGCAC CCGCCTTCGC CGACAGCTAC GCCACCGCGG ACTGCGCGCA GAACCCCCAC CCCGGCTGCG AGGTAAGCGC CGGCACGAAC GGCACCGCGC CCACACCACC CCGCCGTGAC GTCCGGCCCG GGCCGCCGGG CGGCACCTCG ACCGGCCGCG GCGGCGGGGA GAAGAGCGCA CGTCCGCCCG GTGACCTCTC ACTCGACCCC TCGGAACTCG CGGGCTGCGC CTACACGCGC AGCGACTTCC AGCCCGAGGG CGACCCGATC CGGCCGGTAG CCTTCCGCCC AGCTCCACAG GGCAGCCGAC CGCGCGTCGT CGCCGCCCTG GACCGCCTCG GCGCGCCACA GGTGCAACCC GTCGCCACCG CGGCAGACGG GCAGCCCGGA GCCTGGTACG TATACCAGTG CCAGGGCGAC GGATGGCACG ACGCGCTCTA CCGGCCACCG GTATGGATCG CGGACGGGCA GGCCGGCCCC ACAGCCACCG CACCCGACCC GGCCACCCTC GCCGAGCAGG CCCGCAACCA ACTGCGCCTC CAAGGCCCCG CCATCGCCTT CAGCCCAACC AGACGGCAGC TCGTGCGACT ACCAACCTGG ATGTGGCTCG ACCCCGCCAG CTGGCGACCC GTCTCCGCCA CCGCCGCCGC CGGCGGAGTC TCGGTGCAAG CGGTCGCCAC CCCCGCGCAG GTCGTATGGT CCATGGGCGA CGGCACCGAC GTGCGCTGCA CCGGCCCCGG CACCCCCTAC CAGCCCACCG TCGACCCGAC GGCCGCCTCC CCCGACTGCG GCCACACCTA CACCACCGAC TCCGAAGACG AACCCGGCGG CGTCTTCCCC GTGTCCGCCA CCGTCACCTG GAACGTCACC TGGGCCGGCG GCGGCCAGAA CGGCACCTTC AACGGGCTCA CCACCATCTC CACCGCCCAG GTCAGCGTCA TCTCCGTGCC CGCACTGACC ACCGGCGGAG GCTGA
|
Protein sequence | MEGHPVRGRT GRVMLTRSVL IGACLLGFLA TTAPAFADSY ATADCAQNPH PGCEVSAGTN GTAPTPPRRD VRPGPPGGTS TGRGGGEKSA RPPGDLSLDP SELAGCAYTR SDFQPEGDPI RPVAFRPAPQ GSRPRVVAAL DRLGAPQVQP VATAADGQPG AWYVYQCQGD GWHDALYRPP VWIADGQAGP TATAPDPATL AEQARNQLRL QGPAIAFSPT RRQLVRLPTW MWLDPASWRP VSATAAAGGV SVQAVATPAQ VVWSMGDGTD VRCTGPGTPY QPTVDPTAAS PDCGHTYTTD SEDEPGGVFP VSATVTWNVT WAGGGQNGTF NGLTTISTAQ VSVISVPALT TGGG
|
| |