Gene Franean1_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1198 
Symbol 
ID5669611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1430065 
End bp1431828 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content72% 
IMG OID641240130 
Producthypothetical protein 
Protein accessionYP_001505558 
Protein GI158313050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.441597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.262975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGGCT CGGGCGGCGG CGGCTTCGGC GGATTCGGTG GCTTAGGCGG TTCAGGGGGA 
GCCGGAGGTG CGGGCGGTGC CGGCGGCTCG GGGGCCGTCA GCCGGCTACG CCGGTGGAGC
AGGCGGGATG ACGCCGACAG CGCCCCGGTG CTCGACCTCA CCGACGTTGA CGAGGTCACC
ACGATGTTGC TCGCGCCCCC GGCCGAGTAC GCGGGGCTGC GCCGGCGGGT CGTCGACCAG
TTCACGATTG TCAGTGCCAA CCGCTGTCGG CTTACCCGCT CGGTGAACTG GGGCCCGCTC
GACGGGCTGC TGAACAGCAT TGTTCCGGGT ACCGGGCGGC TCGACACCGT CGGCCGGCTG
CCGCCGGAGG TCTGCCTGCT GCTGCCGATC TCCACGTTGC CCAAGCGTGC CCTGGTCGGA
TTCAATCTCG CCGGTCCGAG TGGCTCCGAC GCCCACCTGA TGCCGTACGG GACGTCCGTC
GCGATCCAGG GCAACCTGAT CGCGCTGCTC GCCGACGCCA TCGGCTGCCC GCTGCCCGGC
CCGGCGCGCC GCGTCGTGGA CGCGATCTCC AGGTTCCGCC CGGGGCGGCT GTCCGGCAAC
CTGCCTGGGC TGGTGCCCCG CAAGGGACGT CCGCTCTCGC TCGAGGCCGT CGCGACCTAT
CTTGCGCGTG AGGCGGATCT GTCGGTGCCG GATGTGACAC TGCGGTCGTG GCAGGCGCTG
CTCGAGCCGG CCCAGCTTCT GCTGCGCTCG GCGCTGGCGG AGCCCTTCGA CCCGCTGAGC
AGTGCGGACA CGATGCTGCT CGCCGCCGGC GAACTTTGGC GTGATCCGGA CGTCCCCACA
CCGCTGGAGG TCGCGCACAT CGGTTACTAC CTGCGGGAGT TCACCGCCTG GATCGACGAG
CTGAGCCTGG CGGGCCCGGC CGCGGTGCCG GTGCTGAGCA CGGTGGCGGA GTACGGGCGG
CGCTGGGAGG CGCTCGCCGC CGTCACCCTG GACCCATACC GCCCCTGCTT GATCAAGATG
TCCGAGGAGC GGCGGACGGT GCTGGCTCGG CGTTGCCCCG TCCGTGACGA GTCGCTTACT
CTGCGGCAGC GGTGGCTGGC GCCGGTGGCC CTCGTCGACA TCGACCCGGG CGGCCCGGGC
AGCTACCACG TGAGCGTGGG CACCGACGAC ACCAGTATCG AGCTGAGCAC GCCGATCACG
GTCGACCTGG AGCACCGCCG GATCGCCCGG ACCTACGTCG AGGACGTCCA CCAGAACCGT
GAGGTGTACG CGTTCTACAC CACCGATGCG CGTCGGGCGG CGCGGGCGAA GCTGGTGGTG
GGCCTCAGCG TCTCGCCGGA CGTCGCCCGC GTCACGCTGG CGATCCTGGT CCTGATGTGT
CTCACCGTGG CCTTGTCGGC GTTGCCGTTC GAGCTCGGAG CTGACGCGGT CGCGGTCGTC
GCGGTGCCGT CGTCGTTCGC CGCGACCCTG CTGCTGACCA GGGAGCGGTC GAGCCTCGCG
GCCTGGGTGC TCGGCCCGGC CAAGCAGGCA CTGCTCGCGC TGCTGGTGGC GCTGGCCGTG
CTTTCGGGGC TGCGTGCCCT GGGCTGGCAC ACCCCGCCGG CGGACCCGGG CGGGTCGATC
ATGTCGGTTC CGGGAGTGTC GTCCCAGCTA GGGGCGCTGG CCCCGGCCGT GCCACCGGGG
CGTTCGGTGA GCTTGTCCGC ATTGGTGGCC GCGACGGACC GGCCGGGCAC GTGGCAGGTG
GGAGGAGCTG GTCCGCGCCG GTGA
 
Protein sequence
MAGSGGGGFG GFGGLGGSGG AGGAGGAGGS GAVSRLRRWS RRDDADSAPV LDLTDVDEVT 
TMLLAPPAEY AGLRRRVVDQ FTIVSANRCR LTRSVNWGPL DGLLNSIVPG TGRLDTVGRL
PPEVCLLLPI STLPKRALVG FNLAGPSGSD AHLMPYGTSV AIQGNLIALL ADAIGCPLPG
PARRVVDAIS RFRPGRLSGN LPGLVPRKGR PLSLEAVATY LAREADLSVP DVTLRSWQAL
LEPAQLLLRS ALAEPFDPLS SADTMLLAAG ELWRDPDVPT PLEVAHIGYY LREFTAWIDE
LSLAGPAAVP VLSTVAEYGR RWEALAAVTL DPYRPCLIKM SEERRTVLAR RCPVRDESLT
LRQRWLAPVA LVDIDPGGPG SYHVSVGTDD TSIELSTPIT VDLEHRRIAR TYVEDVHQNR
EVYAFYTTDA RRAARAKLVV GLSVSPDVAR VTLAILVLMC LTVALSALPF ELGADAVAVV
AVPSSFAATL LLTRERSSLA AWVLGPAKQA LLALLVALAV LSGLRALGWH TPPADPGGSI
MSVPGVSSQL GALAPAVPPG RSVSLSALVA ATDRPGTWQV GGAGPRR