Gene Francci3_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1257 
Symbol 
ID3906103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1500564 
End bp1501832 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content74% 
IMG OID637878591 
Producthelix-hairpin-helix DNA-binding, class 1 
Protein accessionYP_480364 
Protein GI86739964 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region
[TIGR01259] comEA protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0994426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAAT CCACCCCTGG TCCCGGGTAC CCGCTGTCGG CCCCGTCGCC GCGTTTCCCC 
GTCCACGCAT TTCCGCCTGA AACGTCCGTT TCGTCGCGAT GGCCATCGCT GGACCTGCTG
CCGTCGGACG ACGCCACCGT GCCGCTGCGT CCGCCGGACC CCCCAGCGGA CGGCTCAGCT
GGTGGATCCC GCCGCGTTCG TGACGCCACG TTCACCGAGG ACGAGCAGGA GATCGGTGGC
GACGAGTCCG ACCTGTTCGG GATTCGCGAG GGCGTGTTCC TTGATGAGGA CCGGGCCGCC
GGCGGATGGG TCGCGGACGA GCTCGCGGCG GAAGGGGCCG ATCTTGGCCG GCTGGTCGCC
GGCCGAACGG ATCGGCGCCG TCCGGTCCAC TCCCGTCTGG TCCACCCCCG TGCGGGGCGC
GAGATGGACG ATGGCGAGGA CGAGGGCGGC CGGGGACCTG ATCGCGGGCC ATCGGCCGGC
CGGCCGGCGC CGGGAGTTCC ACGGGGCGAT CTGGTCGGCG TCATCCGCAG GCGGCTTCCG
GCTACGCTGC ATGGCATCGT GGTGGCGCCG GCCGCCCGGG CCGCCCTCGT TCTCGCGTTG
GTCGCCCTCG CCGCCGCGCT CATGACCGCC TGGTTCTCCT GGCAGCACCG CCCGGTGCCG
CTCGCCTCCT CCGAGGTCGA TACCCCCCGG ACGGGGGAGT CGGCGGCGGC TGGCAACGCC
CGGTCGGACC ACTCGGACCA CGGGGCCACC TCCGCCACGG ATACGAGCGT GACTTCGGCA
CCGACGGCCA GGGCCGCCCG CACCGGGGCG AGCGGCGAGG TGGTGGTCGA CGTCGCCGGC
CGGGTGGCCC GGCCGGGGGT CGTCCGGCTC CCGGCGGGCG CGCGCGTGGT GGATGCGATC
GAACGCGCCG GGGGAGTGCT CCCGGGCACC GACACGACCG GCCTCGCCCT GGCCCGGTTG
CTGGTCGACG GAGAACAGGT CCTCGTCGAC GGCAAGCCCG GCCCGGCGCG GCCCGGAACG
GCGGCCGGGC AACCTGCGGG TACTGGCCTC GGGTCCGCCG GTTCCACCGC GGCGACCGGT
CCGATCGACC TGAATGCCGC GACCGCCGAG GAGCTTGACG GTCTGCCCGG GGTGGGGCCT
GTGCTGGCTC GCCGCATCGT CGAGTGGCGC ACGGCGCACG GGCCCTTCCG GTCGCCCGAG
CAGCTGGCGG AGGTGACCGG AGTCGGCGAC AAGCGGCTGG CCGATCTGCT GCCATTGCTG
AAGGTTTGA
 
Protein sequence
MVESTPGPGY PLSAPSPRFP VHAFPPETSV SSRWPSLDLL PSDDATVPLR PPDPPADGSA 
GGSRRVRDAT FTEDEQEIGG DESDLFGIRE GVFLDEDRAA GGWVADELAA EGADLGRLVA
GRTDRRRPVH SRLVHPRAGR EMDDGEDEGG RGPDRGPSAG RPAPGVPRGD LVGVIRRRLP
ATLHGIVVAP AARAALVLAL VALAAALMTA WFSWQHRPVP LASSEVDTPR TGESAAAGNA
RSDHSDHGAT SATDTSVTSA PTARAARTGA SGEVVVDVAG RVARPGVVRL PAGARVVDAI
ERAGGVLPGT DTTGLALARL LVDGEQVLVD GKPGPARPGT AAGQPAGTGL GSAGSTAATG
PIDLNAATAE ELDGLPGVGP VLARRIVEWR TAHGPFRSPE QLAEVTGVGD KRLADLLPLL
KV