Gene Franean1_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0367 
Symbol 
ID5668791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp435972 
End bp437561 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content79% 
IMG OID641239299 
ProductFHA domain-containing protein 
Protein accessionYP_001504739 
Protein GI158312231 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.216513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000756141 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCGCAC GCTGCCCGGA GGGGCACCTG TCGCACGATC CCGACTACTG CGATCAGTGC 
GGGCTGCGGA TCGTGGGTGG TCAGGAGCGA TACGGCTACC GTCAGGACGA CGCCGGGCGC
GGCGCCGCGT ACAGCGCGGG CGCGCCGACG GGGCGTCCGG CGGACCCGTA CGGGCGTCGG
ACGCCGGAAC GGGACCCCTC GGTGCCCCCG GGGCCTTCGG GCCCCGCCGG AGCGCCGGGA
GGCTATGACC CGCTCGGCAT CCGGCGCGCC CCGGCCGACG CGCCAGGCGT GGCGGATCCG
CCACGTGGCG CCGGCCGCGG CGGCGACCCC TACGGGGGCT ACGGCCGCGA GGGATACGCC
GAGGAGCCGC GCTCCGGACG GTCGTACCCG GCGGCGGACC CATACAACCG CGACGGGTAC
GGGCGCCCCG CCGACGAGCG GGGCCGGCCC GACGAACGGG CCCGGCCCGA CGGCCGCGGT
GAGCGGCCGC GGGAGGGGCT CGACCCCTAC GGCCGTGAGC GCGACGATCG CGGCCCCGGC
GGTCGCGATG ACCGCACCCC GCCCGGATAC GGGCGGGACG ACCGCGGCGG TGCCGGCGGC
TTCGGCGGCG GGCGTGGCGA GCCGGATCCC GCCGGCTACC CTCGCGCCCG CGACGACCGC
GGCGGGTACG GCGCGCCCGA CGGGCGGGGC GGCTACGGCC CCGGCGAGCC GGGCCGGCCG
GCGCCCGAGG CCGACGACCG GCGCGGCTAC CGTGACGACG GCTACGGCCC GAACGGCGGC
GGCTACGGCG CGGACCCGCT CGGCGTGTCG GCCGCCGGCG CGACCCCGGT ACCGGCCCCG
GCCGGCACCC GCGACCGCGG CGCCGACCGC TATGCCCCCG CCCCCTGCCC GAACTGCCGC
TCGCTGAACG AGCCGTCCGC GCGGTTCTGC GAGACCTGCG GGCTCGACTT CAGCACCGGC
CAGCTGCCCG CTCCGGCGGG CGCGGTGCCC GTCGACGACG CGGGCGTGAC CTCCCGGCAG
ACCGGGTCAC GGCGCGGCGC CGAGGCCCGC CACTCCACCG ACCCGCGGAT GGCCGACCCG
CGCCGCGCCG ACCCGAGGCT CGACGATCCC CGGCTCGGCG ACCCGCGCGC GGCCGATCCC
CGGATGGGCG GCGACCCGCG AGCCGGCTCC CGTGGCCGCG AGTGGGACGA GGCCGGTCCC
GGCGGCACGT GGGAGGCCGT CGTCGAGGCG GAACGCGAGT ACTACGACAG CGGTGACGAC
CACCGCGTGC CGTTCCCCGC GTTCTACCCG CGCCGCGTCT TCGTCCTGAC CGAGTCGCAG
ATGCTGATCG GCCGGCGCAG CGAGTCGCGC GGCATCCACC CCGAGATCGA CCTCTCGGGA
GCACCGGAGG ACCCGGGCAT CTCCCGGGCG CACGCGACCC TGGAGCGGAT GCCCGACGGC
ACGTACGCCG TGCGTGACCC CGGCTCGACG AACGGAACCC GCCTCAACGA CGAGCCGGAC
CCGATCGAGC CCGGCCGTCC GATCCCCCTG CGCGACGGCG ACCGCGTCTA CGTGGGTGCC
TGGACGAGGA TCACCGTCCG CGCGCAGTAA
 
Protein sequence
MSARCPEGHL SHDPDYCDQC GLRIVGGQER YGYRQDDAGR GAAYSAGAPT GRPADPYGRR 
TPERDPSVPP GPSGPAGAPG GYDPLGIRRA PADAPGVADP PRGAGRGGDP YGGYGREGYA
EEPRSGRSYP AADPYNRDGY GRPADERGRP DERARPDGRG ERPREGLDPY GRERDDRGPG
GRDDRTPPGY GRDDRGGAGG FGGGRGEPDP AGYPRARDDR GGYGAPDGRG GYGPGEPGRP
APEADDRRGY RDDGYGPNGG GYGADPLGVS AAGATPVPAP AGTRDRGADR YAPAPCPNCR
SLNEPSARFC ETCGLDFSTG QLPAPAGAVP VDDAGVTSRQ TGSRRGAEAR HSTDPRMADP
RRADPRLDDP RLGDPRAADP RMGGDPRAGS RGREWDEAGP GGTWEAVVEA EREYYDSGDD
HRVPFPAFYP RRVFVLTESQ MLIGRRSESR GIHPEIDLSG APEDPGISRA HATLERMPDG
TYAVRDPGST NGTRLNDEPD PIEPGRPIPL RDGDRVYVGA WTRITVRAQ