Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0017 |
Symbol | |
ID | 3903592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 22428 |
End bp | 25316 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637877347 |
Product | CRISPR-associated helicase Cas3 family protein protein |
Protein accession | YP_479140 |
Protein GI | 86738740 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCTGT CCATTATATC TGGGATGGGA GTCTTCATGG ATGATGAGTT CTTCGGTATG ATATGGGGGA AGTCGGCGGA GAAGGCGGGG GGATCGATGC ACCTGCTCCT CGGCCATCTT CTGGATACGG CGGCGGTGGG CGAACTTGTC TGGGATCGTT TCCTCGCATC CACCATTCGG GATCGTCTGG ATGATTGCAG CGATGGTCGC GGGCGGTCTC TGTTCGCGCT GTTGTGCGGC CTGCATGATG TCGGTAAGGC CACGCCGGCC TTTCAGATGA AGGACGAGGG TTTGGCGCAG CGCGTTCGGG CTGCCGGGCT TGGCTGGCGA GGGGTGACTC CTCAGCAGGG CCGGCAGTGG CATCACGCGC GGGCGGGGGC TGTCATCGTC CGCAAGTATC TCCCGCAGGT CGGTTGGAGT CGGCCGGGGT GTGACTGGGT GTGGCCGCTG GTCGCGGGCC ATCACGGCCT GATACCGGAC CGTGGCCGTC TTGTGCACAA GCCCGCCGTC CACGGCGCCG GACCGTGGCT CGACGTCCAA CGTGCGTTCG TCGACCGGGT GGCCGGCGAC CTCAACGTTG ACCTGGCTTC CTTTTCGGAG CTGCGAACAC CATCCCGCGG AGGTCAGCTT GCCCTATCCG GGATGATCAT CATGGCAGAC TGGGTCGCGA GCGACAAAGA GCATTTCGGT GGGCTGTCGG ATCTGGCGGA GATCTCGATG AAGGGCTCGC GGGAGCGTGC ACAGCGAGCC TGGGCACAGT TGGGTCTGCG TGGCGGGTGG CGCTCCGACC GGCCCGCACC CGGCAACCAG TCTGACCTGG TGCACCACAG ATTCGGTAAG CCAGCCCGGC CGGCGCAGCG CGCCGCGGTC CAGGCCGTGC GGGAAATGCC CGGCCCCGGG CTACTGATCC TGGAAGCGCC GATGGGTGAG GGGAAGACGG AGGCGGCGCT CGCCGCCGCG GAGGTACTGG CCGGCAAGGT CGGTGCGGAT GGGGTGTTCG TCGGAATGCC GACGCAGGCG ACGAGCGATC CGATGTTCAG CCGGGTCCGC GGCTGGCTCA CCGCGGTGGA TCCCGAGGTT CCGATCGGCC TACTGCACGG CCGGGCTCGT TTCAACAAGG AGTGGGCGGC ACTTCGGTCG CAGGTCCGGT TCGTCGACGT GCATGACGAT CTCGACGAGT ATGGGATGGC CGACGATTTC GGTACGGGTA CGAGCGGCCC GGGTCGTCGC GACGCACCGC TGGCGGGCGC CGCGGCGGCC GCGGAGTGGT TCTTCGGATC GAAACGGGGG TTGCTCGCTG CGGTGACGGT CGGCACGGTG GATCACCTGC TGCAGGCGGC GACCCGGACG AAGCATGTGA TGCTACGGCA CGCCGGCCTG GCCGGCCGGG TCGTCATCCT TGACGAGGTA CACGCCTACG ACGTCTACAT GGCGCAGTTC CTGTTCGAGG CGTTGCGCTG GCTCGCCGAC ACCGGCGTGC CGGTCATCGT TCTGTCCGCG ACGCTGCCGC CGGTACTGCG TGCACAGCTC GCCGGCGCCT ATCTGCAGGG CGCCCTGCAG CGCCCCGATG TCGACCTCGC CGATCTGCCG CGGCCCACCG GCTACCCGAG CACCACCGCG GTGTATATGG CCGACGGGAA GCCCCGGTTT TCCGTGACGT CCCAGTCACC CTGGCGTGAG TCCGTGCCCG TTGCGGTAGA GATCCTGGCA GAGCCCACCG ACTTCGCCCC GGTGAGCATC GCCGCCGCTG TCTCGGCGGA GGTGAGCGAG GGCGGCTGCG CTCTCGTCGT GTGTAACACC GTCCAGCGGG CCCAGGTCGT CTACGCCGAG CTGCGGACGA CGTTCGGTGA TGACGTGGTG CTGCTGCACT CCCGGTTCAC CGCAGCCGAA CGCGCGAGCC GTACGGAAGA CGTCGTGGAC AAGCTCGGCC CCCCCGGACG GGAGAACGCC CAGGACCGGC CCGCGCGGCT CGTCGTCGTG GCCACCCAGG TGGCCGAACA GTCGTTCGAC GTCGATGTGG ACATCCTCGT CACCGATCTC GCCCCCATCG ACCTTCTGCT GCAACGGGTC GGCCGGCTGC ATCGGCATGA CCGGCCGACC GGGCAGCGAC CGGCCCGGCT TCGCGACCCG AAAGTGATCG TCAGCGGTCT GCGGCTGCCC GACGGGGCTG TGCCACTCTG GCCCGCCGGT AGCCGGGCCG TGTACGGGGA TCATCTACTG CTGCGCTCCG CCGCTCTCGT CGCCGAAGCC GCCACGGGTG GTGGATGGTC GATTCCCGCC GACGTGCCCG GCCTCGTCGC TCTCGGCTAC GGTGATGAGC CTCTCGGTCC GGCGCGCTGG GCTGATGCCG CGGCCGAGGC CAGGCAGGAA TGGGAGGCCA AGGAGCGTCG TCGCCGGGTG AACGCCACCG GATTTCTGCT TTCCGGCGAG GACGGTCTGG GCCTGACCAC GCTCGAGGGT CTGCACGACA AGGCGACCGC GCCGCTGGAA ACCGAGGAAC GCCTCACCGC GGTCGTCCGT GACAGCGACG AGTCCGTGGA GGTCGTCCTC GTCCGCCGCG GACCGGCCGG GTATCTCACG CTCGCGGGCC GATCGCTCGG GCACATCGGC GAAGTTGCCG TGTCTGACGA CGCGGTCCTC GAAGAGGTCG TCGGAGCCAC CATCCGCCTG CCCGCGAACA AGGAGATCAC CGCCGCGGCG CATGCCGAAC TGACTCCCCT GGCCGGCTGG GACGGCGACT CCTGGCTCCG CCGCACCCGC GCCCTGATTC TCGACGATCG CATGTCGGCG ATCCTCGGGG GGCGACACCT GACCTATGAC AACAAGCTCG GGCTGAGCCT CGCGCCTCGA CCGGCTGCTG ATACTCGACC GGCTGCTGGT AAAGGGTGA
|
Protein sequence | MGLSIISGMG VFMDDEFFGM IWGKSAEKAG GSMHLLLGHL LDTAAVGELV WDRFLASTIR DRLDDCSDGR GRSLFALLCG LHDVGKATPA FQMKDEGLAQ RVRAAGLGWR GVTPQQGRQW HHARAGAVIV RKYLPQVGWS RPGCDWVWPL VAGHHGLIPD RGRLVHKPAV HGAGPWLDVQ RAFVDRVAGD LNVDLASFSE LRTPSRGGQL ALSGMIIMAD WVASDKEHFG GLSDLAEISM KGSRERAQRA WAQLGLRGGW RSDRPAPGNQ SDLVHHRFGK PARPAQRAAV QAVREMPGPG LLILEAPMGE GKTEAALAAA EVLAGKVGAD GVFVGMPTQA TSDPMFSRVR GWLTAVDPEV PIGLLHGRAR FNKEWAALRS QVRFVDVHDD LDEYGMADDF GTGTSGPGRR DAPLAGAAAA AEWFFGSKRG LLAAVTVGTV DHLLQAATRT KHVMLRHAGL AGRVVILDEV HAYDVYMAQF LFEALRWLAD TGVPVIVLSA TLPPVLRAQL AGAYLQGALQ RPDVDLADLP RPTGYPSTTA VYMADGKPRF SVTSQSPWRE SVPVAVEILA EPTDFAPVSI AAAVSAEVSE GGCALVVCNT VQRAQVVYAE LRTTFGDDVV LLHSRFTAAE RASRTEDVVD KLGPPGRENA QDRPARLVVV ATQVAEQSFD VDVDILVTDL APIDLLLQRV GRLHRHDRPT GQRPARLRDP KVIVSGLRLP DGAVPLWPAG SRAVYGDHLL LRSAALVAEA ATGGGWSIPA DVPGLVALGY GDEPLGPARW ADAAAEARQE WEAKERRRRV NATGFLLSGE DGLGLTTLEG LHDKATAPLE TEERLTAVVR DSDESVEVVL VRRGPAGYLT LAGRSLGHIG EVAVSDDAVL EEVVGATIRL PANKEITAAA HAELTPLAGW DGDSWLRRTR ALILDDRMSA ILGGRHLTYD NKLGLSLAPR PAADTRPAAG KG
|
| |