Gene Franean1_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0454 
Symbol 
ID5668876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp536845 
End bp537933 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content69% 
IMG OID641239386 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001504824 
Protein GI158312316 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCC GCTGGGCGAT CGCGGGCACC GGCGCCATCG CACACCAGTT CGCCGCCGCG 
CTGGGCCGCT TGCCCGACGC CGAGCTCGTG GCGGTCGGCT CCCGCCGCCA GCAGACGGCT
GACGCGTTCG GCGAACGGTT CGGCATCCCG CGCCGGCGCA GGTACAGCTC ATACGAGCGG
CTCGCCGCCG ACGACGGTGT CGATGTCGTC TACGTGGCCT CCCCGCACTC CCACCATCAC
CGTCACACCC TGCTGTTCCT GAGCGCGGGG CGGGGTGTGC TGTGCGAGAA GCCGTTCGCG
CTTGACGCCG AGCAGGCCGC CGAGATGGTG GTTGCCGCCC AAACCCATGG GCAGTTCCTG
ATGGAAGCCA TGTGGAGCCG CTTCCTGCCT GCCTACGTCA AGATCCGGGA GTTGGTCGCC
GCAGGCGCCA TCGGCACCGT CCTGGCCGTC GAGGGTGACT TCGGATTCCC CCGCCCGGTG
GATCCCGCCA ACCGGGTGTT CGATCTCGCC CAGGGCGGCG GCGCGCTGCT CGACCTGGGC
GTCTATCCGG TGTCGCTGGC CAGCATGCTG CTGGGCGAAC CCGACCGGGT GGTGGCACTC
GGCCAGCTCG GCGAGACCAG CGTCGACGAG CAGGTCGCCG TCCTGATGGG CTACCACACG
GGCGCGGTGG CTGTCGCGAA GGCATCCCTG CGCGCGAGTC TCGCCTGCAC CGGCCGGGTC
TCCGGCACGG AGGGCAGCAT CGAACTCGCC ACGTTCATGC ACTGCCCGGA CAACCTGACC
GTCCGACGGA AGTCCGGCAC CCAACGGCTA TACCTGCCCG CTGACAGTGA CGTCACTGCC
ACCGACGCCA CTGCCACCGC CGGTGATATC GACGGTGCCG ACCGCCGGAA CCATCGGGAC
CGGGCGGCCG GCGGCGGGCT ACATCACCAA ATCCGTCACG TGCACTTCCG GTTACAGGCC
GGCCACCTCG ACAGCGACAT CATGTCCCAG GCCGAATCCG TGTCGGTGAT GCGGACACTC
GATGCAGCCC GGGCCCAGAT CGGTCTGCGC TACCTGGCCA CCCTGATGGA TGGTAGCCGG
CAGAGTTGA
 
Protein sequence
MSFRWAIAGT GAIAHQFAAA LGRLPDAELV AVGSRRQQTA DAFGERFGIP RRRRYSSYER 
LAADDGVDVV YVASPHSHHH RHTLLFLSAG RGVLCEKPFA LDAEQAAEMV VAAQTHGQFL
MEAMWSRFLP AYVKIRELVA AGAIGTVLAV EGDFGFPRPV DPANRVFDLA QGGGALLDLG
VYPVSLASML LGEPDRVVAL GQLGETSVDE QVAVLMGYHT GAVAVAKASL RASLACTGRV
SGTEGSIELA TFMHCPDNLT VRRKSGTQRL YLPADSDVTA TDATATAGDI DGADRRNHRD
RAAGGGLHHQ IRHVHFRLQA GHLDSDIMSQ AESVSVMRTL DAARAQIGLR YLATLMDGSR
QS