Gene Francci3_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1022 
Symbol 
ID3906264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1216285 
End bp1217610 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID637878355 
ProductXRE family transcriptional regulator 
Protein accessionYP_480134 
Protein GI86739734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.842716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.936661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTC GCCGCCCACT ACCCACCGCA CCGACCGGCC TGTGGGACCG CCCCGAGATG 
GCCCAGGCCC TCACCGCACG GGACATGCAG ACCGTGCTGG CGATCTACCG GAAGTGGACC
GGTGCCTCCC AGTCGCAGAT AGCCGCCATG ACCGGCATCC CGCAGCCGTC CATCAGCGTG
ATTGTCCGCG GGAAACGCCA GGTCACCACC ATCGAGAACT TCGAGAAGTT CGCCGACGGA
CTCGGCATCC CCCGAGACCG TCTCGGACTC GCCAGCTCGG AAACCACGGA AACCGCCGGC
AGCGAGACGA GCCCGGACCG GCGCACCGTG ATCGCAGCCG GAGCGCTGTT CGCACTCGAC
GCCGAGCTCG ACGAGGTCAC CCGCCGGATG CAGCAGTTCG CCGCATCCAA CGTCGATGAC
GACGCGCTAC ACCAGCTCGA CACCAGCATC GAAGTCGTGG GCCGCCGCTA CGAGAACAGC
GACGCCGCCA CCGTCTACCC CGTCGCCCTG AAGCAGCGCC GGTGGGTCGC CGACCTGATG
TCCGGACACC AGCACCCCGA CCAGCGCCGC GAGCTGTACG CCATCGGCGG GAAGCTCTCC
GGCCTGCTCG GTTATCTCGC GTTCGACCTC GGCAACGAAC TCGTTGCCCG CGCCTACTGC
AACGAGGCGA TGAGCCTCGC CAAGACCGCC GGACACCGCG ATCTCGCCGC GTGGGTCCGC
GGCACCCAGA GCTTCATCGC CTACTACGGC GGCCGGTACC GCGAAGCCCT GGACCTTGCC
CGCGACGGCC AGCGCTACGC CCGCGGCGGC CCCGCCAGCA TCCGACTCGC CATCAGCGGC
GAAGCCCGCA CCCTGGGCAA GCTCGGCGAC ATCGCCGGAG TCGACGAGGC CGTCGGCCGC
GCCCTGGCCG CCCACGCCCG CATCGAGGAC ACCGACCCCG TCGGCTACTT CCTGTCCTTC
GACCCGTTCA CCGCATCCCG CATCGCCGGC AACGCCGCCT CCGCCTACCT CGCCGCCGGA
GCCCCCGACC GGGCCCGCGA GTTCACAGAC CAGGCCATCC CCATCTTCGC CGCCGCCGAC
TCCACCGCCA GCCACGCCCT CACCCTGGTC GACGCAAGCA TGACCTACCT AACCGGTCCC
AACCCCCAGC CCGACCGCGC CGGAGCACTC GTTGCCGAAG CACTCGACGT CGGCGCCGAT
CTGCGATCCG AAGTGGTCGC CCGCCGGGCC CGGGACTTCC TGCTCACCGC CGCCCAGTGG
CGCACCGTCC CGGAGATCGC CCAGGTCAAC GACGCCGTCA AAGCCTGGAG ACTGCCCACC
GCCTGA
 
Protein sequence
MTRRRPLPTA PTGLWDRPEM AQALTARDMQ TVLAIYRKWT GASQSQIAAM TGIPQPSISV 
IVRGKRQVTT IENFEKFADG LGIPRDRLGL ASSETTETAG SETSPDRRTV IAAGALFALD
AELDEVTRRM QQFAASNVDD DALHQLDTSI EVVGRRYENS DAATVYPVAL KQRRWVADLM
SGHQHPDQRR ELYAIGGKLS GLLGYLAFDL GNELVARAYC NEAMSLAKTA GHRDLAAWVR
GTQSFIAYYG GRYREALDLA RDGQRYARGG PASIRLAISG EARTLGKLGD IAGVDEAVGR
ALAAHARIED TDPVGYFLSF DPFTASRIAG NAASAYLAAG APDRAREFTD QAIPIFAAAD
STASHALTLV DASMTYLTGP NPQPDRAGAL VAEALDVGAD LRSEVVARRA RDFLLTAAQW
RTVPEIAQVN DAVKAWRLPT A