Gene Franean1_0875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0875 
Symbol 
ID5669289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1020729 
End bp1021727 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content75% 
IMG OID641239802 
Producthypothetical protein 
Protein accessionYP_001505237 
Protein GI158312729 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.237559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCGGC CCGCCCACCC TGATCCGCCG GGCGGCGGAT CGACGTCCCA GCCGATCGTT 
CCGACGGCGG CCGCGCGCCC AGCCGGTGAT GGTCCGGCGG ACGTCCCCCC GCCCGTTCCC
GCCGTCGCGC CCGGGCCCGC GGTTGCTCCC GCGGCCGCGG TGTCCAGGCG TACCCGGCGG
GGTGGCGGGC CGCCGCCCGA GCAGCGCCGC GGCCCGGTCA CGGCGCGGCG CGGGCAGGTG
GAGCACTCCA CCACCGACCA ACGCCTGCTC GACACCCGCA GCCCGGCCTC GTTCGTCCAC
AGTGACCCGT GGCGCGTCCT GCGCATTCAG AGCGAGTTCG TCGAGGGCTT CGGGCTGCTG
GCGGATCTGC CCCCAGCCGT CACGGTCTTC GGGTCGGCCC GGGTCGGCCG GGACGAACCC
GAGTACGAGC TGGGACGCCG GCTCGGCGCC GCGCTGGCCG ACGCCGGCTA CGCGGTGATC
ACCGGCGGCG GGCCGGGCGC GATGGAGGCG GTCAACCGGG GGGCGCAGGA GGCCGGCGGG
CTCTCGGTCG GCCTCGGCAT CGAGCTGCCC TTCGAGCAGG ATCTCAACGA CTGGGTCGAT
CTGGGCGTCA GCTTCCGGTA CTTCTTCGTC CGCAAGACGA TGTTCGTGAA GTACGCCGAG
GCCTTCGTCA TCATGCCGGG CGGGTTCGGC ACCCTCGACG AGCTCTTCGA GGCCCTCACC
CTGCTGCAGA CGGGCAAGGT GACCCGGTTC CCGGTGGTGC TCATGGGCAC GGCCTACTGG
TCGGGCCTGC TGGAGTGGCT GCGCTCGACC GTCCTCGGCT CCGCCCGGAT CAAGCCGGGC
GACCTCGACC TGGTGACCAT GACCGACGAC GTCGACGAGG CCGTGCGCCT GATCCTCGAG
GGGACCGGCC GTGCCGGCCC GCCCGCCGCG GCCACCTCCG GCGACGAGAC CGCCAGCGAG
GTCGGCGGGG CCGCCGCGGC CGGTGGGGCG CCCTCGTGA
 
Protein sequence
MTRPAHPDPP GGGSTSQPIV PTAAARPAGD GPADVPPPVP AVAPGPAVAP AAAVSRRTRR 
GGGPPPEQRR GPVTARRGQV EHSTTDQRLL DTRSPASFVH SDPWRVLRIQ SEFVEGFGLL
ADLPPAVTVF GSARVGRDEP EYELGRRLGA ALADAGYAVI TGGGPGAMEA VNRGAQEAGG
LSVGLGIELP FEQDLNDWVD LGVSFRYFFV RKTMFVKYAE AFVIMPGGFG TLDELFEALT
LLQTGKVTRF PVVLMGTAYW SGLLEWLRST VLGSARIKPG DLDLVTMTDD VDEAVRLILE
GTGRAGPPAA ATSGDETASE VGGAAAAGGA PS