Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4214 |
Symbol | |
ID | 3907179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5030772 |
End bp | 5032019 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881541 |
Product | insertion element hypothetical protein |
Protein accession | YP_483290 |
Protein GI | 86742890 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.897707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGACA CGGGTAGCCA GCTGACGGAC TGGATCTCGT TGGGGGTCCT GACGTCGTTC GTCTCGCGGG ACGCGGTTGA TGGGGCGATC GAGGCGACCG GGAGAGGTGC CCGGCGTTCC GACACGACGA TCCCGCCGCG GGTCGCGGTC TACTTCGTGA TGGCGCTCGC GTTGTTCGCG GACGACGACT ACGAGGCGGT CGCGTGCCGG CTCGCCGCCA CCCTGGACGA TCTCGATGTG GTGGGGCCGC GGTGGGAACC GACCTCGGGC GGGTTGACGA AAGCCCGCCA GCGGCTCGGT TCAGCGCCGC TGGCGGAACT GTTCTGCCAG GTCGCCGGGC CGGTCGCGGA CCTCGACACG GTCGGGGCGT TCCTCGGCCC GTGGCGGCTG ATGAGTATCG ACGGACTGGA GTGGGACGTG CCCGCGTCCA GGGAGAACGT CGCCGCGTTC GGCCTGCCCG CGGGCCGTGA CGGCGCGCCG GGGGCGCTCC CGAAGGTCCG CGCGGTCACC GTGTCCGAGT GCGCCTCGCA CGCGCCGGTG CTGGCCGCGT TCGGCCCGGC CGGTGGGGCG AAGTCCGCCA GCGAGCAGGC CCTGGCCCGC ACCCTGTACC CGCGGCTGGC TGAACGCTGG CTTCTGCTCG CGGACCGCAA CTTCTACTCC TGGACGGACT GGTGCACCGC CGCGGACACC GGCGCGGCGT TGCTGTGGCG GGTCAAGGCC AGCCTGCGGC TACCGCCGCT ACGCGCGTTG TCCGACGGCT CGTACCTGAC GGTCCTAGTC AACCCGAAGA TCGGTGGGAA GGCGCGGGAC GCACTCGTCG CCGCGGCCCG GGCCGGTGAG GTACTCGATC CGGCGAAGGC CCGCTACGCC CGTCTCGTCG AGTACGACGT GCCCGACCGC GACGGCGACG GGAAACACGA GATCATCGGC CTGCTCACCA CGATCTGTGA CCCGCGGGAG GCGACCGCGA CCGCCCTGGC CGGGGCGTAT CGGCACAGAT GGGAACACGA GATCGGCAAC AAGCAGCTCA AGACCTACCT ACGCGGCCCG GGGAAGGTCC TGCGCTCGAA GCACCCCGAC ACCGTCTACC AGGAGATCTA CGGCTACCTG CTCACCCACC ACGCGATCAG TGCGCTGACC TGCCAGGCCG CGACCGCCGC CGGGATCGAC CCCGACCGGA TCAAGTTCAA ACGCGCTGTG AGGATCATCC GGGACCGGGT CGTCACCGAC CCGGCTTTTT CCCCCTGA
|
Protein sequence | MTDTGSQLTD WISLGVLTSF VSRDAVDGAI EATGRGARRS DTTIPPRVAV YFVMALALFA DDDYEAVACR LAATLDDLDV VGPRWEPTSG GLTKARQRLG SAPLAELFCQ VAGPVADLDT VGAFLGPWRL MSIDGLEWDV PASRENVAAF GLPAGRDGAP GALPKVRAVT VSECASHAPV LAAFGPAGGA KSASEQALAR TLYPRLAERW LLLADRNFYS WTDWCTAADT GAALLWRVKA SLRLPPLRAL SDGSYLTVLV NPKIGGKARD ALVAAARAGE VLDPAKARYA RLVEYDVPDR DGDGKHEIIG LLTTICDPRE ATATALAGAY RHRWEHEIGN KQLKTYLRGP GKVLRSKHPD TVYQEIYGYL LTHHAISALT CQAATAAGID PDRIKFKRAV RIIRDRVVTD PAFSP
|
| |