Gene Francci3_3308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3308 
Symbol 
ID3904094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3918960 
End bp3920462 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content68% 
IMG OID637880633 
ProductXRE family transcriptional regulator 
Protein accessionYP_482394 
Protein GI86741994 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.032044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.303399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGAGG CTGAGCGGCA GCGAAACGAC ACCTTGGACG AGCTGGTGGC CCTACTCCAA 
CAATTGATCA AGAAATCTGG TACCAACAGG ACGCGGCTGG CCGAGCGGAC AGGATTCCCT
CGCCAGCAAG TTTCCCGGGC AGTCAACGGC CGTGAGGTAC CATCGCCGGA CCTCGCCGAC
GCATTCGACG TCGTGTTCGG TTGCAACGGG AGGATTCGTC ACCTTCGGGA TGAGGCACAC
AGAGAGAAGC GGGCGCGACG TCTCGGCGCC GAACCGCCCC GACGGGACAA GGCGCAGCCG
TCCCCGCAAA GCAGCGTCCC GAAGGAAGGG CAGGAGTTAG GGCTACAGGC CACGCTACCC
ATCACACCGA GCCCGCTGCC TGGACGAGGA CCAAGCAACG GGCCCATAGG CGTAGGTGAC
CCAAGGGAGG CGAGAGCCAC GGACAGACGC GACGCCCTAC GCGCCATGGC GCTGGGGACC
GCGGCACTCG GCCCGGTCGC GGCGGACCTC TCCCGCAGCA TCGCCGGGGC CGACCCGGAC
CCCCTCAGTG TCGACCTAGC CGAAGCGCAT ATCCACCGCA TCGCCGCCGC CTACCGCGTC
ACTCCCCATG GCGAACTCAT GGACGCCCTC GGCCCGGAAT GGCAGAACAT CGAACGCATC
CTCGACCGCC GTGTCTCGCC GCCGGTGCGC GCTCGCCTCA CGTTGATCGC CGGACAGTAC
GCCTTCTACC TCGGTACGCT CGCCTTCGAT CTCGGCGACG ACGACACCGC ACGCAGTCTT
CTCCGGGTCG CCAGCCAACA CGCCGACGAG ACGAAACAAC TCCTGCCTGC CCGCTCTCCA
CGCCGATCCG ACGTCCTGTT GCTCGACGGA TCCGTCGCAG CGATCCGTTC CAGCGTCGCC
TACTTCAACC GCGCGTACAG CGAAGCCGCC GACATCGCCG CCCAGGCACG GGAAGGCGCC
CATCCGTTCG CCCTGCCGAT CCTCGCCGGC TGCGAGGCAC GGGCCGCGGC GCTCGCGCAC
CGACCCGACG ACGCCCGCGC CGCCTTGGCC GACATGCAAG AGCATCTTTG GGACGGCGCA
GTCATGCCCG GCCCGAACCC GGGGGACGCC GCGTTCATAC ACGGCTTCCT CGCCGTCGCG
CTCGCCCACG TCGGCGACGG TGTTCAAGCC GAGCAGCATG CCCGTGTCGG CCTGGATCTG
GAGATCGCGG CCAACCCCGA CCATTATGTG CAGATCGGCG GGAAACACAA CGCCCTCTGC
CGCGCCTACC TCCGCCGCCC CGAACCGGAT CCGGAAGCCG CCGCAGACGC CGCGCGTCAC
GCACTCCTCA CGGTGGACGG ACGACCCAAT CGGACAGTTA TCCAGCAGGC AGGCCAGATG
TGGAGACAGA TGGACGGTAA ATGGCCCGAG CTCCCCACGG TCCGTGACCT CGGCGAGATA
GTACAAACCT CCAGACGAGC CCTCGAATCC GGACCGGGAG ATCCTGCGTC CGCCTGCGCC
TGA
 
Protein sequence
MLEAERQRND TLDELVALLQ QLIKKSGTNR TRLAERTGFP RQQVSRAVNG REVPSPDLAD 
AFDVVFGCNG RIRHLRDEAH REKRARRLGA EPPRRDKAQP SPQSSVPKEG QELGLQATLP
ITPSPLPGRG PSNGPIGVGD PREARATDRR DALRAMALGT AALGPVAADL SRSIAGADPD
PLSVDLAEAH IHRIAAAYRV TPHGELMDAL GPEWQNIERI LDRRVSPPVR ARLTLIAGQY
AFYLGTLAFD LGDDDTARSL LRVASQHADE TKQLLPARSP RRSDVLLLDG SVAAIRSSVA
YFNRAYSEAA DIAAQAREGA HPFALPILAG CEARAAALAH RPDDARAALA DMQEHLWDGA
VMPGPNPGDA AFIHGFLAVA LAHVGDGVQA EQHARVGLDL EIAANPDHYV QIGGKHNALC
RAYLRRPEPD PEAAADAARH ALLTVDGRPN RTVIQQAGQM WRQMDGKWPE LPTVRDLGEI
VQTSRRALES GPGDPASACA