Gene Francci3_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4540 
Symbol 
ID3907517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5424683 
End bp5425684 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content71% 
IMG OID637881873 
Productchromosome segregation DNA-binding protein 
Protein accessionYP_483615 
Protein GI86743215 
COG category[K] Transcription 
COG ID[COG1475] Predicted transcriptional regulators 
TIGRFAM ID[TIGR00180] ParB-like partition proteins 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTC GGCGCAGTGG TCTGGGGCGA GGGCTGGGGG CACTCATCCC GGTTGCTCCT 
CCTCCCCTCG ACGGTCAGGC GGGCACCGCC TCGGCTGACT ACGCTGACTA CGCCGACTAC
GCCAGTGGCT ACGGCGGCCG GGGGACGAAT CGCGGCTCCG GCGACGGCAC CGGGCCGCTG
CCCGTACACG GGGCGACCTT CCGCGAGATC CCCGTCGAGT CGGTCAGCCC GAACCCGCGC
CAGCCCAGGA CTCACTTCGA TGAGGACGCC CTCGAAGAGC TGGCGGCGAG CCTCCGCGAG
GTCGGGCTGC TCCAGCCGAT CGTGGTACGC GAGGTGGCCC CCGAGCGCTA CGAGCTCGTC
ATGGGTGAAC GCCGGTGGCG TGCATCGAAG ATCGCCAAGC TTCCCCGGAT TCCCGCCATC
GTGCGGGAGA CCGCGGACGA CGCCATGCTG CGCGACGCGC TGCTGGAGAA CCTCCACCGC
CAGCAGCTCA ACCCGCTGGA GGAGGCTGCG GCGTACGAAC AGCTGCTCCG CGACTTCGGG
GCGACGCACG AGGAACTGGC CAGTCGGCTC GGGCGGTCCC GGTCGCATGT CACCAACATG
ATCCGGCTGC TCGGGCTCTC GCCCGCGGTA CAAAGGCGGG TGGCGGCCGG CGTGCTGTCG
GCGGGTCATG CCCGCGCGCT GCTCTCGCTA CAGGATCCGG ACGCCCAGGA CCGGCTCGCC
ACCCGCATCG TCGCGGAAGG TCTCTCGGTG CGGGCCGTCG AGGAGATCGT GGCACTGGAC
GACGAGGCCC CGCGTAAGCG GGCGTCGCGG GCACCAGGGG CCGCCTCACC GGCCCTGGTG
AGGCTCGCCG ACCGCCTCTC GGATCGCTTC GAGACCCGCG TGAAGGTGGA CATGGGCCGC
AGTAAGGGGA AGATCACGGT CGAGTTCGCC TCGATCGAGG ATCTGGAGCG GATCGTCGCC
GTGATGTCGC CGTCGGCCTC GGTCGACGGC TTGTTGGACT AG
 
Protein sequence
MSARRSGLGR GLGALIPVAP PPLDGQAGTA SADYADYADY ASGYGGRGTN RGSGDGTGPL 
PVHGATFREI PVESVSPNPR QPRTHFDEDA LEELAASLRE VGLLQPIVVR EVAPERYELV
MGERRWRASK IAKLPRIPAI VRETADDAML RDALLENLHR QQLNPLEEAA AYEQLLRDFG
ATHEELASRL GRSRSHVTNM IRLLGLSPAV QRRVAAGVLS AGHARALLSL QDPDAQDRLA
TRIVAEGLSV RAVEEIVALD DEAPRKRASR APGAASPALV RLADRLSDRF ETRVKVDMGR
SKGKITVEFA SIEDLERIVA VMSPSASVDG LLD