Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2257 |
Symbol | |
ID | 3905025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2634954 |
End bp | 2636708 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637879588 |
Product | hypothetical protein |
Protein accession | YP_481354 |
Protein GI | 86740954 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.197929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00613294 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCATCC AGTACGTGCG GGATGCGACG CCCTCCACCG GGGCGGACGA CCATCACGCG GCCGGAGGGA TGTTGTCCCC GACGAACCCG CCGTCCCCCG CCTCAACCCC GGCCCAGCCC CGGCTGCGCG CCCTGTACCT GTCCGAGCTG GAGACGTTCG AGACCTACGA GACGCACAGC GCCACGCTGC ACCTGGCCGC CGATCGCGTG TACAAGCGCA AGAAGCCGGT GAACCTGGGC TTCCTCGACT TCACCGACCG CCGCACCCGG GAGTCGGTCT GCCGGTCGGA GGTCGCGCTC AACCGCCGGC TGGCCCCCGA CGTCTACCTG GGTGTCGCCG ATCTCCTCGA CGACACCGGT GAGGTGATCG ACCACCTGGT CGTGATGCGG CGGATGCCGG CGAGCCGGCG GCTGTCCACC CTCGTCCGCC GGCGCAGCCG GGTCGGCCCG GCACTGCGCA CGGTCGCCCG GGCGCTGGCG GTGTTCCACC AGCGGTGCGA GACCTCACCG GAGATCGCGG TGGCGGGGCA GCGGGCGACC CTGGAGGGGC TGTGGCGGGA GGGCCTGGAA GGCATCTCCC CCTACCGCGG CACCCTGCTG GACGCGGCGG TGGTCGACGA GATCGGCGAA CTGGCGCTGC GCTACCTGGC CGGCCGGGAG ACCCTGCTCG GCGATCGGGT GCGCGCCGGG TGGATCCGCG ACGGGCACGG CGACCTGCTC GCCGACGACA TCTACTGCCT CGGCGACGGA CCCCGTATCC TCGACTGCAT CGAGTTCGAC CCGCGGCTGC GCTTCGGTGA CGTCCTCGGC GACGTCGCGT TCCTGGCGAT GGATCTGGAA CGCCTCGGCG CGCCGGAGGA GGCCGCCGAG TTCCTCGACG CCTACCGGGA GTTCAGCGGC GAGGTGCACC CGCGGTCGTT GCAGCATCTC TACGTCGCCT ACCGGGCGTT CGTCCGGGCG AAGGTGACCT GTATCCGGGG CGGGCAGGGT GATCCCGACG CGGCCGAGGA GGCCCGCCGG CTGCTGGCCG TCGCCCACCG TCATCTGCGG GCTGGCCGGG TCCAGCTCGT CGTGGTCGGC GGGCTGCCCG GGACGGGCAA GACGACCCTG GCCGGCCGGC TGGCCGGGGT CGGTGACGGC TGGGTGCTGC TGCGCTCCGA CGTGATCCGC CAGGAGCTGA CCGGGATGCC CCTGCGTGAG GGCGGGCCGG CCGCGGACAC CACCGCCGGC GGGTATGCCA GTGCCCTGCG CAACGCCAGC GGCACCGCCA CGAGAACCGG GGCCCGCCGC GACGCCGGTA CCGGCGCGGC CGCGACCTCC GACCCCGCGA CCTCCGACCC CGCGGACGGC GACCCCGCGA CCTCCGACCC GCGGTTCGGC ACCGGGCGCT ACGCCCCCGA GATCACCGAC GCGACGTACG CCGAGATGCT GCGCCGCGCC GAGGCGGCTC TCGCCCGCGG GGAACGGGTG GTGCTGGACG CATCCTGGTC GAGCGCGCGT CACCGCCGGG CCGCCGCCGA GCTCGCCGCA AGCGTCTGCG CCGACCTGGT GGAGCTGCAC TGCGTGACGG CACCGGAGGT GGCGGCCGCC CGGATCGGGC GCCGCGCCGC CGCGGGCACC GACCCGTCGG AGGCGACGAT GGCCATCCAC CGGGCGATGG CTGCCCGTGC CGACCCCTGG CCGTCGGCGA CGGTGGTACG CACCGCCGTC CCGGTCGCCG AGGCCCTGCA GACGGTCCTC GCCCACCTCG ACTGA
|
Protein sequence | MTIQYVRDAT PSTGADDHHA AGGMLSPTNP PSPASTPAQP RLRALYLSEL ETFETYETHS ATLHLAADRV YKRKKPVNLG FLDFTDRRTR ESVCRSEVAL NRRLAPDVYL GVADLLDDTG EVIDHLVVMR RMPASRRLST LVRRRSRVGP ALRTVARALA VFHQRCETSP EIAVAGQRAT LEGLWREGLE GISPYRGTLL DAAVVDEIGE LALRYLAGRE TLLGDRVRAG WIRDGHGDLL ADDIYCLGDG PRILDCIEFD PRLRFGDVLG DVAFLAMDLE RLGAPEEAAE FLDAYREFSG EVHPRSLQHL YVAYRAFVRA KVTCIRGGQG DPDAAEEARR LLAVAHRHLR AGRVQLVVVG GLPGTGKTTL AGRLAGVGDG WVLLRSDVIR QELTGMPLRE GGPAADTTAG GYASALRNAS GTATRTGARR DAGTGAAATS DPATSDPADG DPATSDPRFG TGRYAPEITD ATYAEMLRRA EAALARGERV VLDASWSSAR HRRAAAELAA SVCADLVELH CVTAPEVAAA RIGRRAAAGT DPSEATMAIH RAMAARADPW PSATVVRTAV PVAEALQTVL AHLD
|
| |