Gene Franean1_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0739 
Symbol 
ID5669155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp860622 
End bp862364 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content73% 
IMG OID641239666 
ProductFHA domain-containing protein 
Protein accessionYP_001505103 
Protein GI158312595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.002863 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGTCGGCC CACACCCGCC CGGCGGCCCG GACGACCCGC TCCACGGCCC CACCACCGTG 
CTCGGCAACT CCGCGGCCCC GCAGGCGCCA CCCGTCACAG CACCGCAGGC GCCTCCCGCC
GCGGCACCAG AGCAATCACC CGCCCCGGCA CGGCAGGCGC GACCTGCCCC CGTCCCTGGC
CCGCAGGCAC CACCCGCACC AGCCCGGGTG GGCGGGGGCG AGACTTATGG CGACCTCACC
CGACGGCTGA TCGAGACGTC GGTCTGGACG GACCGCATGC TCGCCGACGC CGGCATCCCC
GTCCACGACG CACCCCTGGT CACGGTGGGC GGTGGGATCG GCTCGTTCGT CCTGGTCGAC
TACCTGCGCA TCGCGGGCGC GCCGACCTCG GCGATCCGGG TGCTGTCCAA CATCGACACC
CCCTGGCAGA CCTACCGGTA CCTGACCCGG GTCTCGCAGA TCCCCGACCA CGAGCGCATC
CGCTCGGACT CGAGCTCGAC CCCGGACAAC ATCTGGGCCT TCCCGTCCTA CGCGGTGCGG
GAGGCGTTCG CGGCCCGAGG CCCGCGCGGG TTCGTCGAAC CGCTGTGGCG GGTCGCGACC
GAGCCACTGC TGTCGGACTA CTTCACCCCC CGCATCAGGA TGGTCTTCGA CGGCATGGCC
CGGGAGGCCG CCCGGATCTC GTACCCCGAG ATGCTGGTCT CCGGGCAGGT GCGGATGGTC
CGCCGCCGCG CGGACGGGGG GTACCTGACC GTCCTCACCC CCCCGGCCGG ACGGTCCGCG
ACGAAGCGGA TCGCCTACCG CAGCCGGTAC GTCCACCTGG GCGTCGGCTA CCCGGGGCTG
AAGTTCCTCC CGGACCTGCA GGAGTACCGC TCCCGGTATG GGGACGTGCG GCGGGTCGTC
AACGCCTACG AGCCGCACGA GCACGTCTAC GACGAGCTGA TCCGGCACCC GGCGACCGTC
GTGGTGCGCG GGGCGGGCAT CGTGGCCTCC CGCATCCTCG ACCGGCTGAT CACCGACCGC
GACCGGCACG GGGCGAGGAC GCACATCGTG CACCTGTTCC GCACGTATGT GCGCGGCTCG
CACGGCCCCA GCGTGTTCAT GCGCCGCCGG GGCGGCGACG GCTGGGCCTA CCAGGGGTTC
AACTACCCGA AGTCGGTCTG GGGCGGCCAG CTGAAGGCAC GCATGCGCAC GCTGGAGGGC
GACGAGCGGA AGGCGCTCTA CGACACGATC GGCGGGACGA ACACCCCCCG CCGGCGCCGG
TGGCAGGCTC AGCTCGCCCG CGGCCGGCAC GAGGGCTGGT ATGTGACCCG GGTGGGCGAG
GTGGAGCGGC TCACCCCCGG AACGGACGGA ACGGTAGTCA CCCGGGTCCG CACCGCCGAC
GGGATGCTGG AGGTGCCGGC GGCGTACGTC ATCGACGCCA CCGGCCTGGT GGCGGACATC
CGCGAGCACC GGGTGCTGGC AGACCTGCTC GACCACTCCG GAGCCGGGCA CAACCCGCTC
GGCCGGCTGG ACGTGGAGCG GACCTTCGAG GTCCGCGGGA CCCGCAACGG GCCGGGGCGG
CTCTACGCCT CGGGCGCGGC GACCCTCGGC GGCTACTTCC CGGGCGTCGA CACCTTCCTC
GGTCTGCAGA TCGCCGCCCA GGAAATCTGT GACGACCTGG CCGCGGAGGG ATTCGTGCCC
CGGATAGGGG TGGCACGTTC GGTGTCGCAG TGGGTGCGTT GGATGCGCAA CCAACCGGTC
TGA
 
Protein sequence
MVGPHPPGGP DDPLHGPTTV LGNSAAPQAP PVTAPQAPPA AAPEQSPAPA RQARPAPVPG 
PQAPPAPARV GGGETYGDLT RRLIETSVWT DRMLADAGIP VHDAPLVTVG GGIGSFVLVD
YLRIAGAPTS AIRVLSNIDT PWQTYRYLTR VSQIPDHERI RSDSSSTPDN IWAFPSYAVR
EAFAARGPRG FVEPLWRVAT EPLLSDYFTP RIRMVFDGMA REAARISYPE MLVSGQVRMV
RRRADGGYLT VLTPPAGRSA TKRIAYRSRY VHLGVGYPGL KFLPDLQEYR SRYGDVRRVV
NAYEPHEHVY DELIRHPATV VVRGAGIVAS RILDRLITDR DRHGARTHIV HLFRTYVRGS
HGPSVFMRRR GGDGWAYQGF NYPKSVWGGQ LKARMRTLEG DERKALYDTI GGTNTPRRRR
WQAQLARGRH EGWYVTRVGE VERLTPGTDG TVVTRVRTAD GMLEVPAAYV IDATGLVADI
REHRVLADLL DHSGAGHNPL GRLDVERTFE VRGTRNGPGR LYASGAATLG GYFPGVDTFL
GLQIAAQEIC DDLAAEGFVP RIGVARSVSQ WVRWMRNQPV