Gene Franean1_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0051 
Symbol 
ID5668477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp63682 
End bp65379 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content68% 
IMG OID641238980 
Producthypothetical protein 
Protein accessionYP_001504425 
Protein GI158311917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCGTG TCAGTCAACG TGCTGAGCAG CGGCAGCTTC GTGGTCGGAT GCGCGAGTTC 
GGCATGAGCC ATGATGAGAT CGCTGGCGAG TTCGGGCGGC GGTTTCGTTT CCGGCCTCGG
GCCGCGTTTC GGCACGCGTT CGGGTGGACG CTCGAGGAGG CGGCCGACGC GATCAATGCT
CAGGCGGCGA GCCTCGGGCT CGATCCGGTA GGCCGGGCGA GCATGACGGG GCCGCGGCTG
TCGGAGCTTG AGTCCTGGCC GGTTTCTGGT GCCCGGCGGC CCACGCCGCA GATCCTCGCA
CTGCTCGCGC ACGTGTATGG CACGGACATC CATCGCCTTC TCGATCTTGA AGACCGTGAG
CACCTAGCGC CCGCGGATCT TCTTCTCCTC GACAGCTGTG TGCGTGAGGT CGGTGCTGCG
CGTGGACAGT CAGCGGTCAC GCCGACCAGC GCCACTCGTG GTCGTAACTC CCTTCCACGG
TCCGGGGCGG CGGGTGGTCT GCCGCAGCCT GCGCTTCTGC AGGCATTCGT AGACAATGAG
CCAATATCCA GGCCTCCTGC TGTGCGTTCG CAGGCGCCGG TGCCTGTCGG GGGTGAGGGA
GAGGGGTCAC CGACGAAGCG TCGCGAGGTT CTCGCCGCTG TGGGCGCGTC TGCCGTCGCG
GCCCTGCTCG CCCAGTCGGC AGCGGAGTCG GCGATGTTCG GGCAGCGCTG GGAGGAGTCC
GATCTCGGCT CGACAACGCT GGAGCACCTT GACCTGGAGG TGCAGCGGTT CGGCCTTTCC
TACCTGCATA CTCCGCCCGA GCAGCTGTTC GTCCAGGTCC GGGAGTGCCG GCAGCGGGTC
TCCGTGATGG TGGCGGGACG CCAGACGCTG GGCCAGCGGC AGCACCTGTG CGTTGTCGGT
GGATGGCTGT CGGGGCTAAT GGGGTCTCTG GCGCTCGATC TCGGTGACAG CACCGTGGCT
CATGCCCACT GCCTGACCGC GTGGCAGCTC GCTGTGGAGT CCGGCGATGC CCGACTGGGC
GGGTGGGTCC GAGGAACCCA GGCGATGATC GCTTTCTACA CCGGCGACGC GGCGGACGCG
TTGCGTTACG CGCTGGCTGG TCTTGACGTC GCTCCGCGGA ACTCGTTTGT CCGGACTCGA
CTTCTTGCGC AGCTGGGACG AGCCCATGCG CGCCTGGGAG ACGCCGACGG TGTGAGCGTC
GCGTTCGCCC GTGCGGATGA CTCCTTCGAG TCGGTGACGG AGACGGCGTC GCACAGCATC
TTCTCTTTCG ACTACCCTTA CCTGCCGTTC TACGCCGGCA CGGCCTACAC TGCGCTCGCC
AGGCCTGAGA AGGCGCGGGC TTTCGCCCAG CAGGCGGTGA CGTTGTGCGA CGCGGCGGCG
GTGAACTGGC CGGTCGCGCG GGCCTTGGCC CGGGTGGATC TCGCGCTCGC CGCAGTGCGG
CAGAAGGAAC CGGACAGGGC GTGCGCCGTG ACCGTCGAGG CGCTGGAGAT CTGCGCGACC
GAACGCCCGT TGGATCTGAT CGTCCGCCGC ACCGGTGAGG TCGTGCGGGA GCTGCGTCCC
TACAAGGAGC TGTCGTCCGT CCGAGACCTT CACGAGCGCC AGACGAGCCT GGCCCGCACG
GTCCGAGGCA CACAGCGGAC CCCTTCCGAC CCCGGCGCCA GTTCAGCGGT GCATGATCGA
CAGGCGGACA TCCTCTAG
 
Protein sequence
MARVSQRAEQ RQLRGRMREF GMSHDEIAGE FGRRFRFRPR AAFRHAFGWT LEEAADAINA 
QAASLGLDPV GRASMTGPRL SELESWPVSG ARRPTPQILA LLAHVYGTDI HRLLDLEDRE
HLAPADLLLL DSCVREVGAA RGQSAVTPTS ATRGRNSLPR SGAAGGLPQP ALLQAFVDNE
PISRPPAVRS QAPVPVGGEG EGSPTKRREV LAAVGASAVA ALLAQSAAES AMFGQRWEES
DLGSTTLEHL DLEVQRFGLS YLHTPPEQLF VQVRECRQRV SVMVAGRQTL GQRQHLCVVG
GWLSGLMGSL ALDLGDSTVA HAHCLTAWQL AVESGDARLG GWVRGTQAMI AFYTGDAADA
LRYALAGLDV APRNSFVRTR LLAQLGRAHA RLGDADGVSV AFARADDSFE SVTETASHSI
FSFDYPYLPF YAGTAYTALA RPEKARAFAQ QAVTLCDAAA VNWPVARALA RVDLALAAVR
QKEPDRACAV TVEALEICAT ERPLDLIVRR TGEVVRELRP YKELSSVRDL HERQTSLART
VRGTQRTPSD PGASSAVHDR QADIL