Gene Francci3_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1195 
Symbol 
ID3903469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1427235 
End bp1428560 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content61% 
IMG OID637878526 
Productputative transcriptional regulator 
Protein accessionYP_480302 
Protein GI86739902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACG AGGGGTTCGC GGCTCACCTA GGCGTAGCCG CTCGGACCGT CGCCAACTGG 
CGGGCACGTC CAGAGGTCGT GCCGCGGCCT GCCGCTCAGG AAATCTTGGA TGCCGCGTTG
GCGCGCGCGC CCTTCAACGT CCGGGAGCAA TTCCGGATGC TGCTGTCCGC CGACGGTCGG
CGGGCAGATT CTCGGGAGAC TTCCAGCGAG AATAAAAATC AGGCAGAGGT TGACGCCAGC
CCTACGGGGC TATGGACTCC GGACGGTACA CTATCGGCAG TAGCCGAAGT CTCGGAGGGA
AGTCCAATGG ACCGAAGGCA ATTTCTTGTT CTTTCGGGTT CCACCCTCAC CTCTCCTGCA
CATGAATGGC TCATTGCGCG GCCATCGAAC GATCTTTCGA GTCAATCAGG GAGATTCGTT
GGAACATCGA TCGTGGACAA CCTGCGCCGT ATCACAGACG AGCTTCGCCG CATGGACGAC
CAGATCGGGA GCGGCCCCCT GGTGCAAGTA GTCCGCAGCC AGGCATCCTA TGTCACCGAC
CTTCTGAAGA ACGGCCGCTA CACCGACTCG GTGAGCCGAG ACCTTTACGG AATGCTTGCC
GAGCTTCTGC GCCTGGCGGG GTGGCTCTCG TTTGACGCGG GGCGCCACGG TCAAGCGCAA
CGCTTTTTCA CCGCAGGGCT GCGCAGCGCC CACACCGCCG GAGACCGCGC GCTCGGCGCG
AACATCCTCG GGTTCATGAG TTGCCAGGCG AAGGACATCG GCCAGTTCAC CGAGTCAGCG
AGATTCGCAG ACAGCGCGAG AACAGGCTAC GCCGGTACCA GCCCGACAGT TTCGGCAATC
CTGAACATGA GGGCCGCCCA GGCGTACGCG AACCTGAAAG ACGCGGTCGA GACGCGCCGG
GCAATCGATG CCGCCTTCGA CGTCTTCGGC GGAAATCCTC CCGGTCACGG AGAACCACCG
TGGTCCTACT GGTTCAATGA GGCTCAGATG AATGAGCAGG TTGGCTACTG CTACATGCGC
CTTGGGGATT GGGAGCGTGC CCGCGACCAC CTGTCCCTGT CTACCGGTGT TACAGGAGGT
CCAGACACTC GGGAAGGGGC TTTGCGTCAA GCCCTGTTGG CTGACACCTA CGCTCAACAG
GGTGATCCGG ACAGTGCATG CGCAATTGGC AACCAGGCGA TTGACGCTCT CACGAATGAG
GTTGATTCAG CGCGCTGCGT CGGGCACGTA AAGCAGGTAA GACAGCATCT TGTACCGTAT
CACAGATTGT CGGTGGTGCA GGAATTTAAC GAGCGAGTAG AGGCCCTCTC CAAATCAATC
ACCTGA
 
Protein sequence
MSNEGFAAHL GVAARTVANW RARPEVVPRP AAQEILDAAL ARAPFNVREQ FRMLLSADGR 
RADSRETSSE NKNQAEVDAS PTGLWTPDGT LSAVAEVSEG SPMDRRQFLV LSGSTLTSPA
HEWLIARPSN DLSSQSGRFV GTSIVDNLRR ITDELRRMDD QIGSGPLVQV VRSQASYVTD
LLKNGRYTDS VSRDLYGMLA ELLRLAGWLS FDAGRHGQAQ RFFTAGLRSA HTAGDRALGA
NILGFMSCQA KDIGQFTESA RFADSARTGY AGTSPTVSAI LNMRAAQAYA NLKDAVETRR
AIDAAFDVFG GNPPGHGEPP WSYWFNEAQM NEQVGYCYMR LGDWERARDH LSLSTGVTGG
PDTREGALRQ ALLADTYAQQ GDPDSACAIG NQAIDALTNE VDSARCVGHV KQVRQHLVPY
HRLSVVQEFN ERVEALSKSI T