Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3786 |
Symbol | |
ID | 3906071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4536417 |
End bp | 4537784 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881113 |
Product | hypothetical protein |
Protein accession | YP_482866 |
Protein GI | 86742466 |
COG category | [S] Function unknown |
COG ID | [COG5282] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03624] putative hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0650027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.741402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGGCA ACTTCCCGTT CGGCTTCACG CCCGCTGGCG GCGGCGACGA GCCCCGGCCC CCCTTCGGTG GCGGCGCCCC GTTCTTTGCC GAGCTCGAAC GGCTGCTGTC CTGGCATGGC GGTCCGGTCA ACTGGGAGCT GGCCCGCCAG GTGGCCGTCA GGTCCCTCGG CGACACCGAC TCCGCGGTGT CCACGGCCCA GCGGGAGCCG GTGAGCGAGG CGATACGGAT CGCCGACGTC TGGCTCGACG CGGTGACCTC GCTGCCGGCC GGAGCCACGG CGGCGCAGGC CTGGTCGCGG CAGGAGTGGA TCGAGAAGAC CCTGCCGGTC TGGCAGACCC TCTGCGACCC GATCGCCACG AAGGTCGCCG AGGCGATGCG CACTGGGATC TCCAGCGGGC TGAACCACCT CGGTGGTGGC GGGATCGAGC TGCCGAGTGA GCTGCGCGGC GCGCTGCCCC CGGGACTCGA CCTCAGCGGG CTGATGGCCA CCGGCGGCCC GATCATGCAG ATGATGGACC GGGTCGGCGG GATGCTGTTC GGCGCCCAGG TGGGGCAGGC GATCGGCACC CTGGCCGCCG AGGTCGTCTC CTCCACTGAG GTCGGGCTGC CACTCGGACC GGTCGGGACC GCCGCCCTGA TCCCGGCGGG TGTCGTCGCC TTCGGTCAGG GCCTGGAGAT CCCCGAGGAC GAGGTCCGCA TCTACCTGGC GCTGCGGGAG GCCGCGTCCA GCCGGCTGTT CGCGCACGTC CCCTGGCTAC GCGCCCACGT CCTGGGCGCG GTGGAGGAGT ACGCCCGCGG CATCGCGGTG GATACCGAGG CCGTCGGACG GGTGATGCGG ATGGTCGACC CGACGGCGTT GATGAACCCT GAACGGCTGT CGGAGGCGCT CGGCGAGGAC GTGTTCAGCG ACGCCACCAC CCCCGAGCAG AAGGCCGCGC TCGCCCGTCT CGAACTGATC CTCGCGCTGA TCGAGGGCTG GGTCGATCAT GTGACGGACG CCGCGGCCAC CGGTCATCTG CCGGCCGCGG GCAAACTGCG CGAGATGGTG CGGCGCCGTC GCGCCGAAGG CGGTCCGGGC GAGCAGACCT TCGCCACGCT GGTCGGTCTC TCCCTGCGGC CACGCAAGCT GCGGGAGGCC GCCGCCCTGT GGGAGGCGGT ACGGGAGGCC CGGGGTCACG AGGGTCGCGA CGCGCTGTGG GCGCATCCCG ATCTGCTGCC CACCGCGGCA GATCTCGCCG CCCCGGAGGC GTTCGTGTCC GGCGCGTCGT CGAGCGCGCT CGACGACCCG ATCGGTGAAA TCGAGAAACT CCGCGGCGCC CCACCCGCCG ATGGCGGATC CGACGGGCCG GACAACCCGG ACAGGTAA
|
Protein sequence | MSGNFPFGFT PAGGGDEPRP PFGGGAPFFA ELERLLSWHG GPVNWELARQ VAVRSLGDTD SAVSTAQREP VSEAIRIADV WLDAVTSLPA GATAAQAWSR QEWIEKTLPV WQTLCDPIAT KVAEAMRTGI SSGLNHLGGG GIELPSELRG ALPPGLDLSG LMATGGPIMQ MMDRVGGMLF GAQVGQAIGT LAAEVVSSTE VGLPLGPVGT AALIPAGVVA FGQGLEIPED EVRIYLALRE AASSRLFAHV PWLRAHVLGA VEEYARGIAV DTEAVGRVMR MVDPTALMNP ERLSEALGED VFSDATTPEQ KAALARLELI LALIEGWVDH VTDAAATGHL PAAGKLREMV RRRRAEGGPG EQTFATLVGL SLRPRKLREA AALWEAVREA RGHEGRDALW AHPDLLPTAA DLAAPEAFVS GASSSALDDP IGEIEKLRGA PPADGGSDGP DNPDR
|
| |