Gene Francci3_4250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4250 
Symbol 
ID3907217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5071936 
End bp5073579 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content74% 
IMG OID637881576 
ProductFHA domain-containing protein 
Protein accessionYP_483325 
Protein GI86742925 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.596037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTACG GGGCACCTGT CTCACGACCC GACTACTGCG ATCGGTGCGG AATTGGGATC 
ATGAGTGGTC AGGAGAGGTA CGAATACCGT CCGGAGGACG GCGGGCGAGC TCCCTACGGA
GCAGATGCCC CCGGCCGCTC GGCTGACGGG AACCGCCGGC GTGGGCCAGC CGAGCAGACC
GGCCCGGATG CGGACGGTCC AGATGGCTAC GACCCGTTGG GCATCCGCCG TTCGCCAGCC
GACCTGCCCG GGGTGACGGA CCCGCCGCGC GGCGGCGACC CGCGCGGCGG CGACGCGTAT
GGCGGGTATG GCCGCGACGG CTACCCCGAC CAGTCCCGGG GGGGCCGCGG TTATCAGGAG
GACCCCTACG GCCCGCCCGG TGGCTATGAT CGTGCCCCCT ACCCACGCGG GCGTGACGAG
CGGCCGGCCC CCGACGACCG GATCGACCAC GGGCGCGATC CCTACGGCCG GGACTGGGGC
ACGCCGGAGA ACCGTGGCGG TTACCCCGTC GGCCGCGACA GCCGCGAGGA GCGCGTCCCG
GGCGGCTACC CCGACCTCGA TCCCCGCGAT GATCCCCGCG ATCCCTACGA CCGCGACCCC
TACAGCCGCG GCCATGACGA CCGGGCCGAC CGCGGCGGCT ACCCCACCGG CCGCGACAGC
CGCGAGGAGC GCGTCCCGGG CGGCTACCCC GACCTCGATC CCCGCGATCC CCGCGATCCC
TACGACCGCG ACCCCTACAG CCGCGGCCAT GACGACCGGG CCGACCGCGG CGGCTACCCC
ACCGGCCGCG ACAGCCGGAC GCCGGGTGGA TATCCGGATC CGCGGGGCGA CCTTGATCGC
GAGTCGGACA CGCACCGCCG GAGCCGGCGC GACGCCGAGT ACGGCTACGG TGCGGACCCG
CTGGGCATGG ACGCCTCCGC CCCGCCCGCG CCAGCCGGCG ACCGGGTCAC CCAGAGCCGG
GGCGGCAACC CGCGGCACAC AACCGACCCG CGGCACACGA CCGACCCGCG GCACGCTCGG
CCCGGCGCCG ACCCGCGGCT GAGTGATCCG CGACTGAGTG ATCCGCGGCT GAGTGATCCG
CGGCTGGGCG ATCCGCGACT GGGCGATCCG CGACTGGGCG ATCCGCGACT GGGCGATCCG
GAACTCGGCG GCACCCGCGG CGGCCACGTC GACCCCCGTG ACGCGGTCCT CGACGACCCG
CGGGCCGGCG CCGGGCGTGG TGCCGGATGG GACCGGGACC GCGCGTCGGC GGGCAGCACC
GTCTGGGAGG CCATCGTCGA GGCGGAGCGC GAGTACTACG ACAGCGGCGA CGACCACCGG
GTCCCGTTCC CGACCTTCTA CCCTCGTCGG GTCTTCGCCC TGGCCGGCTC TCAGATGCTC
GTGGGACGGC GCAGCGAGTC CCGTGGCATC CATCCCGACA TCGACCTGTC CGGAGCGCCG
GAGGACCCAG GGATCTCCCG CAGCCACGCC CTGTTCGAAC TGCTGCCCGA CGGCGGTTAC
GCGGTCCGCG ACCCGGGCTC CACCAACGGC ACCCGACTGA ACGACGAACC GGACCCCATC
GAGCCCGGCC AGCCCGTTCC GCTTCGTGAC GGGGACCGGG TCTACCTGGG CGCCTGGACG
CGCATCACAT TGCGCGCCCG CTGA
 
Protein sequence
MPYGAPVSRP DYCDRCGIGI MSGQERYEYR PEDGGRAPYG ADAPGRSADG NRRRGPAEQT 
GPDADGPDGY DPLGIRRSPA DLPGVTDPPR GGDPRGGDAY GGYGRDGYPD QSRGGRGYQE
DPYGPPGGYD RAPYPRGRDE RPAPDDRIDH GRDPYGRDWG TPENRGGYPV GRDSREERVP
GGYPDLDPRD DPRDPYDRDP YSRGHDDRAD RGGYPTGRDS REERVPGGYP DLDPRDPRDP
YDRDPYSRGH DDRADRGGYP TGRDSRTPGG YPDPRGDLDR ESDTHRRSRR DAEYGYGADP
LGMDASAPPA PAGDRVTQSR GGNPRHTTDP RHTTDPRHAR PGADPRLSDP RLSDPRLSDP
RLGDPRLGDP RLGDPRLGDP ELGGTRGGHV DPRDAVLDDP RAGAGRGAGW DRDRASAGST
VWEAIVEAER EYYDSGDDHR VPFPTFYPRR VFALAGSQML VGRRSESRGI HPDIDLSGAP
EDPGISRSHA LFELLPDGGY AVRDPGSTNG TRLNDEPDPI EPGQPVPLRD GDRVYLGAWT
RITLRAR