Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2941 |
Symbol | |
ID | 3903756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3474202 |
End bp | 3475503 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637880262 |
Product | transposase, IS4 |
Protein accession | YP_482028 |
Protein GI | 86741628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.135118 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGGTCC AGTCTCGCAT GCCCGTCACT CCCACCGATC TCGGACAGGC CGGGTCGGGG CAGCTGGTCC GGATGCGGCG GTCGCTGCGG GTCCTGGGGG CGCACGGCGG TGAGGTGCAG GGGCTCGCCG ACGTACTCGC CGGGGTGCCT GACCCGCGGG ACCCGCGAGG GATACGTCAC CGGCTCCCGG TGATCCTGGG ACTGTCCGCC GCAGCGGTCG CCGCGGGGGA GAAGTCGGTG GAGGAGATCG CGGCCTGGGC TGCGCACGCC CCGACGCAGG TCCTGACCGC TCTCGGGGCG CGGGTCCATC CGGTGACCGG GCAGCCGCAG GCACCGTCGG TGGACACGAT GATCCGGGTC CTGTCCGCGG TGGACAGCTC GGCGCTGGCG AGGGCGGTCG GGATGTTCGC CGCGGCCCGC GCCCGCCAGG CCCGTGGTGG TGGGCGGCGG GTGGTCGCGG TCGACGGGAA GACCCTGCGT GGCGCGGCTG GGCCTGAGGG GCGGGCACCG CACCTGCTCG CGGTCGCCGA ACACGGCACG GGTGTGGTGC TCGCCGAGCA TGAGGTCGGC GCGAAGACGA ACGAGGTCAC CGCGTTCGCA CCGCTGCTCC GCGAACTGCA TTCCCATGAT CCGCTGGATG GGGTGGTGGT GACCGCTGAT GCGTTGCACA CGACCCGCGC CCACGCCGAC CTGATCGTCA CCGAGCTGGG AGCGCACTTC GTGTTCACGG TGAAGGCGAA CACCCCGGCG TTGTCGGTCG ACTGCCACCA GGCGACCGAC TGGACGAAGA TCCCGATCGG GCACAGCGCC GAGGGCAGGG CCCATGGACG GTTCGAACGA CGCACCATCC AGCTGGCCCA GGCCAGCGAG GCGATCCGTG CCCGCTATCC CCATGCCCGC ACCGTGGCGC GGATCCGCCG TCATGTCCGG CGGACCGTGA CCACCGGCAC GGGCCGGGCC CGGGTCACCC GGACGATCCC GAGCACTGTC ACGGTCCACG TCCTGACGAG CCTCACCCTC GACGCGGTCA CACCCGCTGA TCTCGCGGGC TACGCCCGAG GGCATTGGAC GATCGAGAAC AAGGTCCACT GGGTGCGCGA TGTGACGTTC CGTGAGGATG CCTCGCGGGT TCGGACCGGC CCACTGCCCC GCATCATGAC CACGCTCCGT AACCTGATCA TCGGGCTGAT TCGCCTCGCT GGCCATAACC GCATCGCCCC GACCATCCGC AGAATCCGAC ACGACAACGC CCTGCTCCTG GCCATCCTCA CTCTCGACAA CCCCGCTGAC CTGCATCAAT GA
|
Protein sequence | MPVQSRMPVT PTDLGQAGSG QLVRMRRSLR VLGAHGGEVQ GLADVLAGVP DPRDPRGIRH RLPVILGLSA AAVAAGEKSV EEIAAWAAHA PTQVLTALGA RVHPVTGQPQ APSVDTMIRV LSAVDSSALA RAVGMFAAAR ARQARGGGRR VVAVDGKTLR GAAGPEGRAP HLLAVAEHGT GVVLAEHEVG AKTNEVTAFA PLLRELHSHD PLDGVVVTAD ALHTTRAHAD LIVTELGAHF VFTVKANTPA LSVDCHQATD WTKIPIGHSA EGRAHGRFER RTIQLAQASE AIRARYPHAR TVARIRRHVR RTVTTGTGRA RVTRTIPSTV TVHVLTSLTL DAVTPADLAG YARGHWTIEN KVHWVRDVTF REDASRVRTG PLPRIMTTLR NLIIGLIRLA GHNRIAPTIR RIRHDNALLL AILTLDNPAD LHQ
|
| |