Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3597 |
Symbol | |
ID | 3904151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4296046 |
End bp | 4299714 |
Gene Length | 3669 bp |
Protein Length | 1222 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880918 |
Product | chromosome segregation protein SMC |
Protein accession | YP_482678 |
Protein GI | 86742278 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02168] chromosome segregation protein SMC, common bacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATCTCA AGAACCTGAC CCTGCGGGGC TTCAAGTCCT TTGCGAGCTC GACCTCCCTG CATCTCGAGC CGGGAATAAC CTGCGTGGTC GGGCCCAACG GGTCGGGCAA GAGTAACGTC GTTGACGCCA TCGCCTGGGT ACTGGGGGAG CAGGGTGCCA AGGCCCTGCG TGGCGGCACT ATGTCCGATG TCATCTTCGC GGGCACGCCC GCTCGCCCGG CCTTGGGGCG CGCCGAAGTA CTCCTGACGA TCGACAACTC GGACGGCGCG CTCCCCATCG AGTACACCGA GGTGACCATC GGGCGACTGA TGTTCCGCAG CGGCGAGAGC GAATACACGA TCAACGGTAC CGGTTGTCGC CTGCTGGATA TCCAGGAACT GATGAGCGAC TCGGGTATCG GCCGAGAGTT GCACGTGATC GTCGGCCAGG GCCAGCTCGA CGCGGTCCTG CACGCGCGCC CGGAGGATCG CCGTGCGTTC ATCGAGGAGG CCGCCGGCGT CCTCAAACAC CGTAAACGCA AGGAGAAGGC ACTCCGCAAG CTCGAGGCGA TGTCGGCGAA CCTCACCCGC CTCACCGATC TGTCCGCCGA ACTGCGTCGG CAGTTAGGGC CGCTCGGTCG GCAGGCGGAG ATCGCCCGTA AGGCCGGTGT GATTCAGGCT TCGCTACGGG ACGCCCGCCT GCGCCTTCTC GCCGACGATC TCCATCGTGC CCAGGTCGCG ATCACCTCCG ACCTCGCGGA CGAGGAGGCA TTGCGGGCGC GGCTGACGAC CACCGAGGCC GCTCACGCCG CGGCCGCCCG CCGTGAGGAG CAGCTGCAGG CCGACCTGAC GGCGATCATT CCGCGCGCCA CCGGCGCCCA GGAGACCTGG TACGCCCTGG CATCCCTGCG GGAACGGCTG CGGGGCACCC GTTCCCTCGC CGTCGAACGG GGTCGCCTGT TGCGTGCCGC GACCGACGAC GTACGTGGGC GCCGCGACCC CGAGGAACTG GAACAGGAGG CCACGGCGGT CCGGGAGCAG GAAATCGCCC TCACCGAGCG GCTGGAGCGT GATCGGGATC TGCTCGCCGA GGTGGTCACC AGGCGTGCCG ATCTCGAGGC CGCGCTCGCT CAGGAGGAGA AGGAACTGAT CGCCGCGGCA CGCGCGGCGT CTTTTCGCCG TGAGGAGATC GCGCGCCTTG CCGGCCAGGT GGAGGCCGCC AGGTCCCGTG CCACCAGTGC CGAGAACGAG ATAGCACGGA CGACCGAGGC ACTCGACGCC GCGCGGGAGC GGGAGACCGA GACCTCGGCG TCACGTTCGG CCCTCGAGAT CGAACTCTCT CGGATGGAGA GCGCACGGGA GGATCTCGCC GAGCAGCACA GCGCGGCCGT GGCAGTGCAT GCCGCTGCCG CCGAGCGGCT CGAGGTCCTG CGCGGTGAGG AGCGTAGCGC CGAACGTGAC CGCGCATCCT GGGCCGCGCG ATGCGACGCG CTGCACCTCT CCCTCTCGCC GGCCGACGGA GCCGCGGCAC TGTTGAGGGC TGCTACCGGC GCGCAGACCG GGCAGACCGC CTCGAACACG GCGGAGGACT CGGCTCCGGC ACTGGATCCG ACCCTCGAGG TGATCGGCCG GCTGGCCGGC GTCCTGAGCG TGACCGCCGG CGCGGAGGCG GCCATCGCCG CGGCGCTGGG CCCGGCGGCG GACGCCCTGC TCGTCGGCAC GTCGGGGGAT GCCATGGCGG CCTTTGCGTG GCTGCGGGAA ACGGATGCCG GCCGAGCAGC GCTGGTGGCC GCCGTCACCG TCGCGGATGA GCCTGACGCC TGCGCCGGCC CTGACGCCTG CGCCGGCCCT GACGGCAGCG CGGCGATCGT CCCGGGGGCG GCCAGCGCCG ACGGTCCGCT CCCGGCCGGT GCCGTTGCCG CGCTGGACCT GGTGGAGATC GGTGACGATC GGTTCCGTAC CGCCGTGTCG TCCCTGCTCG CCCGGACTGT CGTGGTCGAG GATCTGGCGG CGGCCGAGCG GGCCACGGCG TTGCGCCCCG ACCTGCGGGT GGTCACCCGC GCCGGCGACG TCATCGGTGC CCCCCTCTCT ATCGGTGGCA GCGCGAACCC GCCGTCCGCG ATCGAACTGC AGGCCGCGGC GGACGAGGCC GAGGCCGGCG TGGCGGAGGC GACCCGTCGC GTCGAGTCCG CCCACGAGGC GTTCGAGCCC GCGCGTGCCG AGGTGACCCG GGCCCGTACC GCCATCGACG CGGCACTGGC GGCGCTGCAC GGGACCGATG CCCGGCACCG GGCGCTGTCC GAGCAGATGG CGCGGCTCGA TCGTTCGGGA GGGGCCGCGG CATCCGAGAT CGCCCGGTTG GAGGGCGCCC GATCACGGGC CGAAACAGCC CGTGATCGGG CATACGGAGC GTTGACCGAG CTCGAGGCGT CGCTCGCCGC GACATCGGAG CAGCCGGAGG CGGGGGAGCG GGCTCCGGAT GAACGAGACC GGCTCGTCGC GGCTACCTCC GCCGTGCGCG CCGCCGAGGT CGAGGCCCGG TTGGCCGTCC GTACCAGCGA GGAGCGGGTG CGAGGCCTGC AGGGACGCGC CGACGGACTG ATCCGCGCGG CGGCGAACGA GCGGGCGGCA CGGGCCGCGG CGGCCCGTCG GCACGAGGTC CGCGAGCGGC AGGCGGCCGT CGCGACGGCG ATGGGGGATG CGGCTCAGGT CGCTCTCGAC CGACTGGACC ATTCGCTGGC TCGCGCCGCG GCGGGGCGGG AGGAGGCCGA CGCACTGCGC AAGGCCGCCG AGACGGAGCT GGTCGGGGTG CGTGACCAGG GGCGCGCTCT GGCCACCGAA CTCGCGGCGC TGCGCGACGC CGCCCATCGC GACGAGCTGG CCCGAGCGGA GAAGCGGCTG CGGGTGGAGA CCCTGGAAGC CAAGGCTCTG GAGGAGCACG GTATCGACGC CGATGACCTG GTGGCCGAGT TCGGCCCGGA CACGCTGGTC CCGCCGGATG AGCCCGACGG CACCGCGGCG CCGTTCGATC GAGCCGAGCA GAGCGCCCGG GCCGCCACCG CGGAGAAGCA GCTGGCCCGG CTGGGGCGGG TGAATCCCCT GGCCCTCGAG GAGTTCGCCG CGCTTCAGGA GCGGGCCGCG TTCCTGTCCG CGCAGCTCGA AGACATCAAG AGCACCCGCC GGGATCTTCT GCTGGTCGTC GAGGAGGTTG ATCTGCGGGT GCGTGAGGTC TTCGCCGTCG CGTTCGCGGA CACCGCCCGT GAGTTCGAGA TCGTCTTCTC GACGCTGTTC CCGGGCGGTG AGGGCCGGCT GGTGCTCACC GATCCTGATG ACATGCTCAC CACGGGCATC GAGGTCGAGG CGAGGCCGCC GGGAAAAAAG GTGAAGCGTC TGTCGCTGTT GTCCGGCGGG GAGCGTTCAC TGACCGCACT CGCTCTCCTG CTCGCGATAT TCCGGGCCCG CCCTTCGCCC TTCTACGTGC TCGACGAGGT CGAGGCAGCT CTGGACGACC GCAATCTCGG CCGACTGCTC GAGGCCGTCG AGGGTCTGCG TGAGAAGTCG CAGCTGATCA TCATCACCCA CCAGAAGCGG ACGATGGAGA TCGCTGACGC CCTGTACGGC GTGGCCATGC GGGGCGACGG GGTGACGACA GTGATCAGTC AGCGGCTGCG GGAGCGGGCG GCGCTCTGA
|
Protein sequence | MHLKNLTLRG FKSFASSTSL HLEPGITCVV GPNGSGKSNV VDAIAWVLGE QGAKALRGGT MSDVIFAGTP ARPALGRAEV LLTIDNSDGA LPIEYTEVTI GRLMFRSGES EYTINGTGCR LLDIQELMSD SGIGRELHVI VGQGQLDAVL HARPEDRRAF IEEAAGVLKH RKRKEKALRK LEAMSANLTR LTDLSAELRR QLGPLGRQAE IARKAGVIQA SLRDARLRLL ADDLHRAQVA ITSDLADEEA LRARLTTTEA AHAAAARREE QLQADLTAII PRATGAQETW YALASLRERL RGTRSLAVER GRLLRAATDD VRGRRDPEEL EQEATAVREQ EIALTERLER DRDLLAEVVT RRADLEAALA QEEKELIAAA RAASFRREEI ARLAGQVEAA RSRATSAENE IARTTEALDA ARERETETSA SRSALEIELS RMESAREDLA EQHSAAVAVH AAAAERLEVL RGEERSAERD RASWAARCDA LHLSLSPADG AAALLRAATG AQTGQTASNT AEDSAPALDP TLEVIGRLAG VLSVTAGAEA AIAAALGPAA DALLVGTSGD AMAAFAWLRE TDAGRAALVA AVTVADEPDA CAGPDACAGP DGSAAIVPGA ASADGPLPAG AVAALDLVEI GDDRFRTAVS SLLARTVVVE DLAAAERATA LRPDLRVVTR AGDVIGAPLS IGGSANPPSA IELQAAADEA EAGVAEATRR VESAHEAFEP ARAEVTRART AIDAALAALH GTDARHRALS EQMARLDRSG GAAASEIARL EGARSRAETA RDRAYGALTE LEASLAATSE QPEAGERAPD ERDRLVAATS AVRAAEVEAR LAVRTSEERV RGLQGRADGL IRAAANERAA RAAAARRHEV RERQAAVATA MGDAAQVALD RLDHSLARAA AGREEADALR KAAETELVGV RDQGRALATE LAALRDAAHR DELARAEKRL RVETLEAKAL EEHGIDADDL VAEFGPDTLV PPDEPDGTAA PFDRAEQSAR AATAEKQLAR LGRVNPLALE EFAALQERAA FLSAQLEDIK STRRDLLLVV EEVDLRVREV FAVAFADTAR EFEIVFSTLF PGGEGRLVLT DPDDMLTTGI EVEARPPGKK VKRLSLLSGG ERSLTALALL LAIFRARPSP FYVLDEVEAA LDDRNLGRLL EAVEGLREKS QLIIITHQKR TMEIADALYG VAMRGDGVTT VISQRLRERA AL
|
| |