Gene Franean1_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0222 
Symbol 
ID5668647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp270648 
End bp271739 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content73% 
IMG OID641239151 
Producthypothetical protein 
Protein accessionYP_001504595 
Protein GI158312087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.338684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGG ATGCCCGTCG GGAAGTTCCT ACCGGAACAC GGTCGAGCGA GGTCACGTAC 
GACCATCTGA TCATTCCGCC GCACTGGCGG CGGCCGGCGG GGCCCGGGCC GGGGGAGCCG
AACCCGCTGT TCCCCGCGGG CGAGCAGGCG ACGGCACCGC CGCCCGGCCC CGCCGTGACC
GGCGCGTCTC CGATCAATCC GATGACCGGC CCGATGACCG GTCCGCTGCT GGTCGGCACG
ACTGCCCGCG CCTTCGCCAA GCTGGACGAC ATGGTCCGGC TCGGTGACGA CGACCGCGCG
GTGATCGACG ACCGACGGGC GGAGGCGGAG CGGGCGCTGC GGTCGATCTT CCCGCCGCGC
TGCGCGCTGC CCCTGGTGGG CGTGGCCACG ATCGGGTCCG CCGGGCGCGA CACGATGATC
CGTCCGCTCG ACGAGGTGGA CATCTTCGTG GTCTTCAGCG CGGCGAACAG CGCGTGGAAG
CGCTTCCGGT GGGATTCTCG CGACCTGCTC GTCTGCGTCC GCAACGCCAT CGGTGGCGAC
CGGGTGCAGA CGATCGGCAC CCGCGGCCAG GCGCTGCGCA TCGTCTACGA CGCCGCGCCG
GACGTCCACC TCGTGCCGGC CTTCGACCAC CCCCGCGCCG GCTACGTCAT CCCGGACAGA
GTGGGCGGCT GGCTGCCGAC CCGGCCGGAG CGGCACGCGA GCTGGACGAT GGACCTCGGC
CCGCGGGTCA TCTCGGCGGT CCGGCTGCTC AAGGCGTGGA ACCGGGTGTG CGGCAGCCAC
CTGCGCTCGT TCCACATCGA GGCGCTCGCG GGGCAGGTGC TCGCGGGCCG CGGTCTCAAC
ACGCGCCAGG GCCTCGCCGA GGTGTTCCGG CACATGGACG AGGTCGGCCT CGTGGTCGGC
GATCCGTCCG ACATCCGCGG TGACCTGTCC AGCTACCTCC GCCAGGACGA CCTCGAGGAT
CTTGGCGCCT TCGTCCGCCA GGCACGCACC TACTCGGCCA AGGCGGTCGC GGCCGAGCGC
GCCGGGGACC ACGAGGAGGC CGTCAGCCTG TGGGGCACCG TCTTCGGCCC GGAGTTCCCG
ACCTTCGGGT GA
 
Protein sequence
MSVDARREVP TGTRSSEVTY DHLIIPPHWR RPAGPGPGEP NPLFPAGEQA TAPPPGPAVT 
GASPINPMTG PMTGPLLVGT TARAFAKLDD MVRLGDDDRA VIDDRRAEAE RALRSIFPPR
CALPLVGVAT IGSAGRDTMI RPLDEVDIFV VFSAANSAWK RFRWDSRDLL VCVRNAIGGD
RVQTIGTRGQ ALRIVYDAAP DVHLVPAFDH PRAGYVIPDR VGGWLPTRPE RHASWTMDLG
PRVISAVRLL KAWNRVCGSH LRSFHIEALA GQVLAGRGLN TRQGLAEVFR HMDEVGLVVG
DPSDIRGDLS SYLRQDDLED LGAFVRQART YSAKAVAAER AGDHEEAVSL WGTVFGPEFP
TFG