Gene Francci3_2408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2408 
Symbol 
ID3906391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2791626 
End bp2795117 
Gene Length3492 bp 
Protein Length1163 aa 
Translation table11 
GC content71% 
IMG OID637879738 
ProductSNF2-related 
Protein accessionYP_481504 
Protein GI86741104 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.644608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0402994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCCG TGTCGTGGCA GGGACCCGGC ACCGCCGGTC TCCGGACAGC CGTCAGCGAA 
CGTTCGTATG CCAGAGGTGT GCACTACGCC CAGCAGAGCG CGGTCACGAG GATCGACTGG
GACCCCGACG AGAACACCCT CCAGGGCCGG GTGCGTGGAA GCGGCGGCGA GGTCTACGCG
ACGAACGCGC AGTTCTCGGG TGACGGTCCG TCCTCTTGGA CGTTCCAGTA CGGCTCGTGC
ACCTGCCCGG TGGGCGCCGA CTGCAAGCAC GTCGTGGCGC TGGTGCTGAC GGCCACGACA
GGCGCGACGG CCACGGCGAG CGCGGCAACG GGTGCGGCGG CCCCCGGCCC ATCGGCGCGG
CGGTCCGGGT CCTCCCCGCG GTCGGCTCGG CCCCGGCCGG CTCAGCCGCG GCCGTCCCGG
TCGGCTGCCT GGACCGAGGA CCTGGGAGAG CTGCTGGGTT CCTCGCCCGC CGTCGGCGGC
GGCCATGGCG GGGCCGCCGG GACGACGCCG TTGGCCGTCG AGCTGAGCTT CACGGCGGAT
CCGCCGGCCC ACGGCGGCGG GTCCGAGCTG GGCCTGCGGG TCACGGCCCG GCTGGTGCAG
CCGGGCCGGA ACGGCTGGGT CAACGCCATC GGGTGGGACG CCCTCAACGC GGCGTACCAG
ACCCGCGAGT ACCAACAGCC GCAGGTACGG CTGCTGCAGG AGTTCTTCGC CGTGTACCGG
GCCCATGAGG AGCGGTATGG GTACTCGTAC TACTCCTCCT ACTCTTCCGG CGGCGCGAAA
CGACTCGATC TCTGCGCGTT CGAAAGCCGG CAGCTCTGGT CGATGCTCGA CGAGGCCGAG
GCGGTCGGCC TCCGGTTCGT GCATGCCCGC AAGAAGCTCG GCGACGTGAG CCGGTACGGT
TCCGCGGAAC TGTGCCTGAA CGTCACCCAG GACGAGAGCA CGCAGTCGCT GGTCGTGGCT
CCCCTGCTCC ACCTCGGCGG GACGGCGACG GACGCCGCCG TGTTCTCGTT CATCGGCCGC
GACGGCCACG GCGTCGTGTA CGTCGACCGG GCGGAGGTCC TGACCTCCGA CGACCATCGG
GACTGGCACG TCCGGCTCGC GCGGCTCGCC CAGCCGGTGC CGCCACGGCT GCAGCGGATG
GCGGTCGGGA ACCGGAGCCT GCGGGTGCCC CGCAGCGAGG AGTCCCGCTT CCGCGCGGAG
TTCTACCCGC GCCTGCGGCG CCTGGCCCCC GTGGTGTCCA GCGACGGTTC CTTCGAGCCG
CCGGAGGTGC CGCCGCCGGC CCTGGTGCTG CACGCCTCCT ACGGCGATGA CCACGAACTC
GACCTCAGCT GGGGATGGGC CTACGAGGTC GGCGGCGAGC GGATCCACGT GCCGCTGTAC
GCGACGGAGC CGGACGGCAC CGAACCGGAC GGCACGATGC TGCCCGCCGC GACGCCTCCC
GCCGCGGAAC CGGACGGGTT ACGGGATCCG CGCCAGGAGC AGGACGCGTT GCGGGACGTG
CTGGCGGCGC TGGATCTGCC CTCCGGCACG GCCGCCCTGC TCGCCGGTGA CCAGGGTTCC
GGTCTGCGGC CACGGGTGCG GCTGCGCGGC GTCGACACGA TGCGCGCGAC GACGGAGCTG
CTCCCGTTGC TGGCGGACCG GCCCGGGGTG GCCGTCGAGG TCAGCGGTGA CCCCGCGGCC
TACCGGGAGG CCGGCGACTC GCTGCTCATC GGCCTGTCCA CCGACGATGT GGTGGGCGAG
ACCGACTGGT TCGATCTCGG GGTCACCGTC ACCGTGGAGG GCCGGCAGGT GCCGTTCACG
GACGTCTTCC TGGCGCTGAG TCAGGGGCAG TCGCACCTGC TCCTCGCCGA CGGCGCCTAC
TTCAGCCTGC AGAAGCCCGA GCTGCAGTCG TTGCGGCGAC TGATCGAGGA GGCGCGGGCA
CTACAGGACT CACCCGGCGG CCCGCTGCGG ATCAGTCGGT TCCAGGTCGG GCTGTGGGAG
GAGCTCGCCG GGCTCGGTGT GGTCGAGCGC CAGGCGCGGT CGTGGCGCCA GCAGGTGACG
GGGCTGCTGG AGCTGGGTGC CGAGGGCCAG CTCGAACAGG AGCCGCCGGC GACGCTCGCG
GCGACGTTGC GCTCCTACCA GCTCGACGGT TTCCGGTGGC TGGCGTTCCT CTGGAAGTAC
CGGCTGGGCG GGGTGCTCGC CGACGACATG GGCCTGGGAA AGACGCTGCA GACGCTGGCG
CTGGTCTGCC ACGCCCGGCA GGCCGATCCC GAGCTCGCGC CGTTCCTCGT CGTCGCGCCG
ACGAGCGTCG TGTCGAACTG GGCCGCGGAG GCGGCCCGGT TCGCGCCCGG GCTGCGAACG
GTGGCGATCC GCGACACGCA GCGGCGGCGT GGCGAGGCCC TCGACGAGGC GGTGTCGGGC
GCCGACCTGG TGATCACCTC GTACACGCTC TTCCGCCTCG AGCACGAGGA GTACGCGAGC
CTGGCCTGGT CGGGCCTGAT CCTCGACGAG GCCCAGATGA TCAAGAATCA CCAGGCGAAG
GCCTACCGGT GCGCCCGGCT GCTGCCGGCG CCGTTCAAAC TGGCCGTCAC CGGGACCCCG
ATGGAGAACA ACCTGATGGA GCTGTGGTCG CTGCTGTCGG TGGCCGCGCC CGGCCTGTTC
CCGAACCCGA TCCGCTTCCG CGACTACTAC GCCCGGCCGG TCGAGAAGCA GAACGACGTC
GAGCTGCTCG CCCAGCTCCG ACGGCGGATC AGGCCGCTGA TGATGCGCCG CACCAAGGAA
CAGGTGGCGC CCGAGCTCCC GCCAAAGCAG GAACAGGTCC TGGAGGTGGA CCTGCACCCG
CGGCACCGCA GGCTCTACCA GACCTATCTG CAACGGGAAC GGCAGAAGGT ACTGGGCCTG
GTCGACGACA TGAACCGCAA CCGCTTCACG ATCCTGCGCT CGCTGACCCT GCTGCGCCAG
CTCAGCCTGC ACGCCGGCCT GGTGGACGAC CACCACGCAG ACCTGCCGTG CGCCAAGATC
GACGCTCTGT TCGAGCAGCT CACCGACGTC GTCGACAGTG GTCACCGGGC CCTGGTCTTC
AGCCAGTTCA CCGGATTTCT CGGCAAGGTA CGCGAACGGT TGTCCGCCCT CGGTGTGGAG
CACTGCTACC TCGACGGCCG GACCCGCGAC CGGTCCACCG TGCTCGAGCG GTTCAAGACG
GGGTCGGCCC CGGTTTTTCT CGTCAGCCTG AAAGCCGGCG GGTTCGGCCT GAACCTGACC
GAGGCGGACT ACTGTTTCCT GCTCGATCCG TGGTGGAATC CCGCGACCGA GGAACAGGCG
GTGGACCGGA CGCACCGCAT CGGGCAGTCC CGCAACGTCA TGGTCTACCG CCTCGTCGCC
CGGGACACGA TCGAGGAGAA GGTGATGGCG ATGAAGGACC GGAAGGCACG GCTGTTCTCC
AGCGTCATGG ACGACGGTGA CGTGTTCAGC TCGACGCTCG ACGCCGACGA CATCCGCGAG
CTGTTCGCCT GA
 
Protein sequence
MLSVSWQGPG TAGLRTAVSE RSYARGVHYA QQSAVTRIDW DPDENTLQGR VRGSGGEVYA 
TNAQFSGDGP SSWTFQYGSC TCPVGADCKH VVALVLTATT GATATASAAT GAAAPGPSAR
RSGSSPRSAR PRPAQPRPSR SAAWTEDLGE LLGSSPAVGG GHGGAAGTTP LAVELSFTAD
PPAHGGGSEL GLRVTARLVQ PGRNGWVNAI GWDALNAAYQ TREYQQPQVR LLQEFFAVYR
AHEERYGYSY YSSYSSGGAK RLDLCAFESR QLWSMLDEAE AVGLRFVHAR KKLGDVSRYG
SAELCLNVTQ DESTQSLVVA PLLHLGGTAT DAAVFSFIGR DGHGVVYVDR AEVLTSDDHR
DWHVRLARLA QPVPPRLQRM AVGNRSLRVP RSEESRFRAE FYPRLRRLAP VVSSDGSFEP
PEVPPPALVL HASYGDDHEL DLSWGWAYEV GGERIHVPLY ATEPDGTEPD GTMLPAATPP
AAEPDGLRDP RQEQDALRDV LAALDLPSGT AALLAGDQGS GLRPRVRLRG VDTMRATTEL
LPLLADRPGV AVEVSGDPAA YREAGDSLLI GLSTDDVVGE TDWFDLGVTV TVEGRQVPFT
DVFLALSQGQ SHLLLADGAY FSLQKPELQS LRRLIEEARA LQDSPGGPLR ISRFQVGLWE
ELAGLGVVER QARSWRQQVT GLLELGAEGQ LEQEPPATLA ATLRSYQLDG FRWLAFLWKY
RLGGVLADDM GLGKTLQTLA LVCHARQADP ELAPFLVVAP TSVVSNWAAE AARFAPGLRT
VAIRDTQRRR GEALDEAVSG ADLVITSYTL FRLEHEEYAS LAWSGLILDE AQMIKNHQAK
AYRCARLLPA PFKLAVTGTP MENNLMELWS LLSVAAPGLF PNPIRFRDYY ARPVEKQNDV
ELLAQLRRRI RPLMMRRTKE QVAPELPPKQ EQVLEVDLHP RHRRLYQTYL QRERQKVLGL
VDDMNRNRFT ILRSLTLLRQ LSLHAGLVDD HHADLPCAKI DALFEQLTDV VDSGHRALVF
SQFTGFLGKV RERLSALGVE HCYLDGRTRD RSTVLERFKT GSAPVFLVSL KAGGFGLNLT
EADYCFLLDP WWNPATEEQA VDRTHRIGQS RNVMVYRLVA RDTIEEKVMA MKDRKARLFS
SVMDDGDVFS STLDADDIRE LFA