Gene Francci3_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3597 
Symbol 
ID3904151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4296046 
End bp4299714 
Gene Length3669 bp 
Protein Length1222 aa 
Translation table11 
GC content72% 
IMG OID637880918 
Productchromosome segregation protein SMC 
Protein accessionYP_482678 
Protein GI86742278 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1196] Chromosome segregation ATPases 
TIGRFAM ID[TIGR02168] chromosome segregation protein SMC, common bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCTCA AGAACCTGAC CCTGCGGGGC TTCAAGTCCT TTGCGAGCTC GACCTCCCTG 
CATCTCGAGC CGGGAATAAC CTGCGTGGTC GGGCCCAACG GGTCGGGCAA GAGTAACGTC
GTTGACGCCA TCGCCTGGGT ACTGGGGGAG CAGGGTGCCA AGGCCCTGCG TGGCGGCACT
ATGTCCGATG TCATCTTCGC GGGCACGCCC GCTCGCCCGG CCTTGGGGCG CGCCGAAGTA
CTCCTGACGA TCGACAACTC GGACGGCGCG CTCCCCATCG AGTACACCGA GGTGACCATC
GGGCGACTGA TGTTCCGCAG CGGCGAGAGC GAATACACGA TCAACGGTAC CGGTTGTCGC
CTGCTGGATA TCCAGGAACT GATGAGCGAC TCGGGTATCG GCCGAGAGTT GCACGTGATC
GTCGGCCAGG GCCAGCTCGA CGCGGTCCTG CACGCGCGCC CGGAGGATCG CCGTGCGTTC
ATCGAGGAGG CCGCCGGCGT CCTCAAACAC CGTAAACGCA AGGAGAAGGC ACTCCGCAAG
CTCGAGGCGA TGTCGGCGAA CCTCACCCGC CTCACCGATC TGTCCGCCGA ACTGCGTCGG
CAGTTAGGGC CGCTCGGTCG GCAGGCGGAG ATCGCCCGTA AGGCCGGTGT GATTCAGGCT
TCGCTACGGG ACGCCCGCCT GCGCCTTCTC GCCGACGATC TCCATCGTGC CCAGGTCGCG
ATCACCTCCG ACCTCGCGGA CGAGGAGGCA TTGCGGGCGC GGCTGACGAC CACCGAGGCC
GCTCACGCCG CGGCCGCCCG CCGTGAGGAG CAGCTGCAGG CCGACCTGAC GGCGATCATT
CCGCGCGCCA CCGGCGCCCA GGAGACCTGG TACGCCCTGG CATCCCTGCG GGAACGGCTG
CGGGGCACCC GTTCCCTCGC CGTCGAACGG GGTCGCCTGT TGCGTGCCGC GACCGACGAC
GTACGTGGGC GCCGCGACCC CGAGGAACTG GAACAGGAGG CCACGGCGGT CCGGGAGCAG
GAAATCGCCC TCACCGAGCG GCTGGAGCGT GATCGGGATC TGCTCGCCGA GGTGGTCACC
AGGCGTGCCG ATCTCGAGGC CGCGCTCGCT CAGGAGGAGA AGGAACTGAT CGCCGCGGCA
CGCGCGGCGT CTTTTCGCCG TGAGGAGATC GCGCGCCTTG CCGGCCAGGT GGAGGCCGCC
AGGTCCCGTG CCACCAGTGC CGAGAACGAG ATAGCACGGA CGACCGAGGC ACTCGACGCC
GCGCGGGAGC GGGAGACCGA GACCTCGGCG TCACGTTCGG CCCTCGAGAT CGAACTCTCT
CGGATGGAGA GCGCACGGGA GGATCTCGCC GAGCAGCACA GCGCGGCCGT GGCAGTGCAT
GCCGCTGCCG CCGAGCGGCT CGAGGTCCTG CGCGGTGAGG AGCGTAGCGC CGAACGTGAC
CGCGCATCCT GGGCCGCGCG ATGCGACGCG CTGCACCTCT CCCTCTCGCC GGCCGACGGA
GCCGCGGCAC TGTTGAGGGC TGCTACCGGC GCGCAGACCG GGCAGACCGC CTCGAACACG
GCGGAGGACT CGGCTCCGGC ACTGGATCCG ACCCTCGAGG TGATCGGCCG GCTGGCCGGC
GTCCTGAGCG TGACCGCCGG CGCGGAGGCG GCCATCGCCG CGGCGCTGGG CCCGGCGGCG
GACGCCCTGC TCGTCGGCAC GTCGGGGGAT GCCATGGCGG CCTTTGCGTG GCTGCGGGAA
ACGGATGCCG GCCGAGCAGC GCTGGTGGCC GCCGTCACCG TCGCGGATGA GCCTGACGCC
TGCGCCGGCC CTGACGCCTG CGCCGGCCCT GACGGCAGCG CGGCGATCGT CCCGGGGGCG
GCCAGCGCCG ACGGTCCGCT CCCGGCCGGT GCCGTTGCCG CGCTGGACCT GGTGGAGATC
GGTGACGATC GGTTCCGTAC CGCCGTGTCG TCCCTGCTCG CCCGGACTGT CGTGGTCGAG
GATCTGGCGG CGGCCGAGCG GGCCACGGCG TTGCGCCCCG ACCTGCGGGT GGTCACCCGC
GCCGGCGACG TCATCGGTGC CCCCCTCTCT ATCGGTGGCA GCGCGAACCC GCCGTCCGCG
ATCGAACTGC AGGCCGCGGC GGACGAGGCC GAGGCCGGCG TGGCGGAGGC GACCCGTCGC
GTCGAGTCCG CCCACGAGGC GTTCGAGCCC GCGCGTGCCG AGGTGACCCG GGCCCGTACC
GCCATCGACG CGGCACTGGC GGCGCTGCAC GGGACCGATG CCCGGCACCG GGCGCTGTCC
GAGCAGATGG CGCGGCTCGA TCGTTCGGGA GGGGCCGCGG CATCCGAGAT CGCCCGGTTG
GAGGGCGCCC GATCACGGGC CGAAACAGCC CGTGATCGGG CATACGGAGC GTTGACCGAG
CTCGAGGCGT CGCTCGCCGC GACATCGGAG CAGCCGGAGG CGGGGGAGCG GGCTCCGGAT
GAACGAGACC GGCTCGTCGC GGCTACCTCC GCCGTGCGCG CCGCCGAGGT CGAGGCCCGG
TTGGCCGTCC GTACCAGCGA GGAGCGGGTG CGAGGCCTGC AGGGACGCGC CGACGGACTG
ATCCGCGCGG CGGCGAACGA GCGGGCGGCA CGGGCCGCGG CGGCCCGTCG GCACGAGGTC
CGCGAGCGGC AGGCGGCCGT CGCGACGGCG ATGGGGGATG CGGCTCAGGT CGCTCTCGAC
CGACTGGACC ATTCGCTGGC TCGCGCCGCG GCGGGGCGGG AGGAGGCCGA CGCACTGCGC
AAGGCCGCCG AGACGGAGCT GGTCGGGGTG CGTGACCAGG GGCGCGCTCT GGCCACCGAA
CTCGCGGCGC TGCGCGACGC CGCCCATCGC GACGAGCTGG CCCGAGCGGA GAAGCGGCTG
CGGGTGGAGA CCCTGGAAGC CAAGGCTCTG GAGGAGCACG GTATCGACGC CGATGACCTG
GTGGCCGAGT TCGGCCCGGA CACGCTGGTC CCGCCGGATG AGCCCGACGG CACCGCGGCG
CCGTTCGATC GAGCCGAGCA GAGCGCCCGG GCCGCCACCG CGGAGAAGCA GCTGGCCCGG
CTGGGGCGGG TGAATCCCCT GGCCCTCGAG GAGTTCGCCG CGCTTCAGGA GCGGGCCGCG
TTCCTGTCCG CGCAGCTCGA AGACATCAAG AGCACCCGCC GGGATCTTCT GCTGGTCGTC
GAGGAGGTTG ATCTGCGGGT GCGTGAGGTC TTCGCCGTCG CGTTCGCGGA CACCGCCCGT
GAGTTCGAGA TCGTCTTCTC GACGCTGTTC CCGGGCGGTG AGGGCCGGCT GGTGCTCACC
GATCCTGATG ACATGCTCAC CACGGGCATC GAGGTCGAGG CGAGGCCGCC GGGAAAAAAG
GTGAAGCGTC TGTCGCTGTT GTCCGGCGGG GAGCGTTCAC TGACCGCACT CGCTCTCCTG
CTCGCGATAT TCCGGGCCCG CCCTTCGCCC TTCTACGTGC TCGACGAGGT CGAGGCAGCT
CTGGACGACC GCAATCTCGG CCGACTGCTC GAGGCCGTCG AGGGTCTGCG TGAGAAGTCG
CAGCTGATCA TCATCACCCA CCAGAAGCGG ACGATGGAGA TCGCTGACGC CCTGTACGGC
GTGGCCATGC GGGGCGACGG GGTGACGACA GTGATCAGTC AGCGGCTGCG GGAGCGGGCG
GCGCTCTGA
 
Protein sequence
MHLKNLTLRG FKSFASSTSL HLEPGITCVV GPNGSGKSNV VDAIAWVLGE QGAKALRGGT 
MSDVIFAGTP ARPALGRAEV LLTIDNSDGA LPIEYTEVTI GRLMFRSGES EYTINGTGCR
LLDIQELMSD SGIGRELHVI VGQGQLDAVL HARPEDRRAF IEEAAGVLKH RKRKEKALRK
LEAMSANLTR LTDLSAELRR QLGPLGRQAE IARKAGVIQA SLRDARLRLL ADDLHRAQVA
ITSDLADEEA LRARLTTTEA AHAAAARREE QLQADLTAII PRATGAQETW YALASLRERL
RGTRSLAVER GRLLRAATDD VRGRRDPEEL EQEATAVREQ EIALTERLER DRDLLAEVVT
RRADLEAALA QEEKELIAAA RAASFRREEI ARLAGQVEAA RSRATSAENE IARTTEALDA
ARERETETSA SRSALEIELS RMESAREDLA EQHSAAVAVH AAAAERLEVL RGEERSAERD
RASWAARCDA LHLSLSPADG AAALLRAATG AQTGQTASNT AEDSAPALDP TLEVIGRLAG
VLSVTAGAEA AIAAALGPAA DALLVGTSGD AMAAFAWLRE TDAGRAALVA AVTVADEPDA
CAGPDACAGP DGSAAIVPGA ASADGPLPAG AVAALDLVEI GDDRFRTAVS SLLARTVVVE
DLAAAERATA LRPDLRVVTR AGDVIGAPLS IGGSANPPSA IELQAAADEA EAGVAEATRR
VESAHEAFEP ARAEVTRART AIDAALAALH GTDARHRALS EQMARLDRSG GAAASEIARL
EGARSRAETA RDRAYGALTE LEASLAATSE QPEAGERAPD ERDRLVAATS AVRAAEVEAR
LAVRTSEERV RGLQGRADGL IRAAANERAA RAAAARRHEV RERQAAVATA MGDAAQVALD
RLDHSLARAA AGREEADALR KAAETELVGV RDQGRALATE LAALRDAAHR DELARAEKRL
RVETLEAKAL EEHGIDADDL VAEFGPDTLV PPDEPDGTAA PFDRAEQSAR AATAEKQLAR
LGRVNPLALE EFAALQERAA FLSAQLEDIK STRRDLLLVV EEVDLRVREV FAVAFADTAR
EFEIVFSTLF PGGEGRLVLT DPDDMLTTGI EVEARPPGKK VKRLSLLSGG ERSLTALALL
LAIFRARPSP FYVLDEVEAA LDDRNLGRLL EAVEGLREKS QLIIITHQKR TMEIADALYG
VAMRGDGVTT VISQRLRERA AL