Gene Franean1_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0172 
Symbol 
ID5668597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp205839 
End bp208469 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content77% 
IMG OID641239101 
Producthypothetical protein 
Protein accessionYP_001504545 
Protein GI158312037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.211375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCGT CGACAGGCCG GGTCCCGGGC CCGCCCGGAA CGCTCGCCCA CTGGCTGCGC 
GCGCTGCCGG ACGACACCCT GGTCCGGCTG TTCAGACTCC GGCCCGATCT CGCCACCCCG
CCCCCACCAG ACTTCGACAT CCTCGCGGCG CGGCTGGAGA TCCGCGGCAG CGTGAGTCGG
GCGCTGGAGC GGCTGGACAC CTTCACCCTC GAGGTCGTCC AGGCGCTGAC GCTGCTGCCG
AGCCCGGTCT CCGTGGCCGA GCTGACCGCC TTCTGCGGCG GGCTCGACCT GCGTGCCGCC
CTCGAGAGCC TGCGGGAGCG ACTCGTCGTC TGGGGCCCGG ACGAGGCGTT GCGCCTGGTC
GGCGTCGCCT TCGACCTGCT GGGCGAGCGT CCGCTCGGGT TGGGGCGGCC GGTGCGGGCG
TGCCTGGCCG GCTACCGGCA GGCGCAGCTC GCCCGGGTGG CGGCGGCCGT GGGGCTGCCC
GACGCGGTGT CCGACGGGTT GTCCGCCAGC TCGGGCGGCC TCGGCGACTC GGACGACTCC
GATCTCGATC TCGATCTCGA CTCCGAATTC CACCTTGGTG CCGGCAGCCA TCCCGTCCGG
CGCGAGAGCA TGATCGAGAT GGTGTCGGCC GCCTTCGCGG ACCCTCACCG CGTCGCCGCC
CTGCTCGACG GGTGCTCGGC GCGGGCGCGG CGGCTGGCCG AACGGCTCGC TGCCGGCCCG
GCGCTGGGCG CTACCAGCGA CGCCGAACGT CTGCTGAGTG TGTCCTCCGC CCGCAGCCCC
GTCGAGGAGC TGCTGGCCCG CGGGCTGCTC ATCGGCATCG AGCCGGGCAC CGTCGAGCTG
CCCAGGGAGG TCGGCCTGGT GCTGCGCGGC GCCGACCAGG CCGGGCCGCT GCATCCCGAG
CCGCCTGAGG TCACCGGCCG CGAGGTGGGG GCGGGCGCCG TCGACCCGGC CGCCGCGCTG
GCCGCCGACG CGCTGGTCCG CGCGGTCACC ACGCTGCTCA CCGCCTGGGG GAGCACCCCG
GTCACCCCGC TGCGCACCGG CGGGCTCAGC GTGCGCGACC TCAAGAACAG CGCCCGGCTG
ATGGACGTCC CCGAGACCGA GGCGGCCGTC GTCATCGAGG CGGCGGCCGC GGCCGGGCTG
GTCGACCTGA CACCCGGCAC CGACGTCCAG TTCGTGCCGA CCAACGTCTA CGACCGGTGG
TGCACCGAGA CGGTGGCCAT GCGGTGGGCC GTCCTCGCCG AAGGGTGGCT GCGCTCGCCG
TCGGCGGCCT GGCTGGTCGG CGGGCGCGAC GAGCGCGGCC GGCAGATCGC CGCGATGTCC
CTCGACGCCC GCCGTCCCGG CGCCCCGGAC CTGCGCGCCG ATGTGCTGCG CGTCGTCGCC
GCCGCGCCGG AGGGCTTCGC GCCCACCCCC GAGTCGGTGC GCGCCCGCCT GGCCTGGCGT
TCCCCACGCC GGACGGGCCC GCTGCTCGAC GGGATGATCG GCGGGACGCT CACCGAGGCC
GAGGTGGTCG GGTTCACCGG GCGTGGTGCG CCCAGCACCC TCGGGCAGCT CGTCGCGCGC
CGCCTCGCCG CCGCCGAGGC CGACGACCCG GGGCGTGGCT CCGGCAGGTC CCCGGCGGCC
GACGAGGGGC TGTGTCGCCT GCTCGCCGAC GCGATGGCCC CGCTGCTGCC CGAACCCGTC
GAGGAACTGC TCATCCAGAC CGACCTGACG GCCGTCGCCC CCGGCCCGCT CGTGCCGCGG
GTCGCCGCCG AACTGTCCCG GATGGCTGAC ATCGAGTCCG CGGGCGCGGC CACCGTGTAC
CGCTTCACCG AGAGCTCGCT GCGGCGCGCG ATGGACGCCG GCAGCTCCGC CGACGACCTC
CACGACCTGC TGGGCCACCT CGCCCGTGGC GGCGTCCCGC AGTCGCTGAC CTACCTGATC
GACGACACGG CCCGCCGGCA CGGACGGTTG CGGTCCGGGC CGGCCGCCTC CTACCTGCGC
TGCGACGACA CCGCGCTGCT CACCGAGGTC GTCGCGTCCC GGCGCACCCA GGCCCTCGCG
ATGCGCCGGG TGGCCCCGAC GATCGTCATC TCCCCCCTAC CGGTGTCCGA CCTGCTGGAA
GGACTGCGCG CGGCCGGTTT CGCCCCGGTG GCGGAGGCCC CGGACGGGCG CATTGTGCTG
GCCCGCCCGG AGGTGCACCG CACCCCCGCC CGCGCCCGCC CGCCCGCCGC GGAGTCCGTC
CCGACCAGGT CGAACCAGCT GCGCGACGTC GTCCGTCTGG TGCGCCGGGG CGACGACAGC
ACCCGCGCCG CGCGGGCGGC GCAGGACGCC GCGGGGGCGC AGCTCGGGCT CGCACGCTCC
GCCCCGGTGA TCCTGGTGAT GCTGCAGGGC GCGGTCCGGG ACCGCCGGCG CGTCCTGCTC
GGCTATGTCA ACCAGCAGGG GACTCCCAGC GACCGGGTCG TGCGCCCGAC GCTGCTGGAG
GGCGGTTGGC TCACCGCGTG GGACGAGCGC AGCGAGGCTC CCCGCCGCTT CGCCCTGCAC
CGGGTGACCG GGGTGGCCGA CATCGACGAC CCGTTCGGCG GCCCGCCGGT GCCCGACACC
GGCGACTGGG TGGTTCCGCC GGCGGCCGAC GACCTGCCCG GCCCGCGCTG A
 
Protein sequence
MSPSTGRVPG PPGTLAHWLR ALPDDTLVRL FRLRPDLATP PPPDFDILAA RLEIRGSVSR 
ALERLDTFTL EVVQALTLLP SPVSVAELTA FCGGLDLRAA LESLRERLVV WGPDEALRLV
GVAFDLLGER PLGLGRPVRA CLAGYRQAQL ARVAAAVGLP DAVSDGLSAS SGGLGDSDDS
DLDLDLDSEF HLGAGSHPVR RESMIEMVSA AFADPHRVAA LLDGCSARAR RLAERLAAGP
ALGATSDAER LLSVSSARSP VEELLARGLL IGIEPGTVEL PREVGLVLRG ADQAGPLHPE
PPEVTGREVG AGAVDPAAAL AADALVRAVT TLLTAWGSTP VTPLRTGGLS VRDLKNSARL
MDVPETEAAV VIEAAAAAGL VDLTPGTDVQ FVPTNVYDRW CTETVAMRWA VLAEGWLRSP
SAAWLVGGRD ERGRQIAAMS LDARRPGAPD LRADVLRVVA AAPEGFAPTP ESVRARLAWR
SPRRTGPLLD GMIGGTLTEA EVVGFTGRGA PSTLGQLVAR RLAAAEADDP GRGSGRSPAA
DEGLCRLLAD AMAPLLPEPV EELLIQTDLT AVAPGPLVPR VAAELSRMAD IESAGAATVY
RFTESSLRRA MDAGSSADDL HDLLGHLARG GVPQSLTYLI DDTARRHGRL RSGPAASYLR
CDDTALLTEV VASRRTQALA MRRVAPTIVI SPLPVSDLLE GLRAAGFAPV AEAPDGRIVL
ARPEVHRTPA RARPPAAESV PTRSNQLRDV VRLVRRGDDS TRAARAAQDA AGAQLGLARS
APVILVMLQG AVRDRRRVLL GYVNQQGTPS DRVVRPTLLE GGWLTAWDER SEAPRRFALH
RVTGVADIDD PFGGPPVPDT GDWVVPPAAD DLPGPR