Gene Franean1_0288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0288 
Symbol 
ID5668712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp338215 
End bp339744 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content73% 
IMG OID641239218 
Producthypothetical protein 
Protein accessionYP_001504660 
Protein GI158312152 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG GGCTCCTGGT GACGGGCACG ACACAGGCGA TCCGTGCTTC GCACGAGATC 
CTCGAACGGG CGCTGGAGCT GTCCCGCGCG GACGGCGCGA TCGTGATCGC CAGCGAGTCG
AGCACGGTGA ACCTGCGCTG GGCGAACAAC ACGCTCACCA CGAACGGTGC CGCGCGTGAC
CGCTCCGTGA CCATCATCAG CGTGATCGGG CGCTCGTTCG GGGTGCGCTC CACCTCGACG
GTGGACGGGC CGGGCGTCGG GGCCGATCTG GCGGGCCTGG AGGCGCTGGT CCGCGCATCC
GAGGCGGCCG CACGCGAATC AGACGACGCC GAGGACTACA GCGACCTGGT CGTGCCCGCG
GCGGCGGCGG TCGGCGGCCG GTCCTTCACC GACCCGGCCG AGCGGACCAG CAGCGAGGTG
TTCGCCTCCT TCGCCGACGA CCTGGCCGAG GCGTTCGGCG CGGCGCGGGC GGAGGGCCGG
CGCCTGTTCG GTTTCGCCGA GCACGACCGC ACCACGACCT GGCTCGGGAC GTCGACCGGC
CTGCGGCTGC GGCACAGCCA GCCCACCGGC TCGGTGGAGT GGAACGCGAA GAGCGCCGCC
CCCGGTGGCT CCGTGTGGCA CGGGCAGTCC ACCCAGGACT TCACCGATGT GGACGTCGCC
GCGACCGACG CGCGTCTGCG GGCCCGGCTG CGCTGGTGCG ACCGGTCGGT GGAGCTGCCG
GCCGGCCGGT ACGAGACGCT TCTCCCCCCC TCCGCCGTCG CCGACCTCAT GGTCTACCTG
TACTGGTCGG CGGCCGGGCG GGACGCCGCC GAGGGACGCA CGGTGTTCAG TCGCGCCGGC
GGCGGGACGC GCGTGGGCGA GGCGCTCGGC CCCGCCGGCC TGCGCCTGTG GAGCGACCCC
AGCGCGGCGG GCCTGACGAG CGCGCCGTTC GTGACGGCTG GTGCGTCCTC GGCGACCTCG
AGTGTCTTCG ACAACGGGCT GCCGCTGGGA CCGACCGACT GGATCCGGGA CGGTCGTCTC
AACGCGCTTG TCCAGACCCG CTCGTCGGCG CGCGCCGCGA GCCTGGCCGC CCCGGCCGCC
GCGCCGAACA GCACCAGCAA CAGCTCCGGC AACGGCATCA GCGCCGGCGC GGCCGGCCTC
GCCGGTCAGG GCATCGCCGT GACCCCCTTC GTCGACAACC TGCTCCTCGA CGGCGGCGGC
ACCGCCACCC TGGACGAGAT GATCACCTCG ACGCGGCGCG GCCTGCTCCT GACCTGCCTG
TGGTACATCC GGGAGGTCGA CCCGCAGGTC CTGCTACTCA CCGGCCTCAC CCGGGACGGC
GTCTTTCTCA TCGAAAACGG TGAGGTCGTC GGGGCGGTCA ACAACTTCCG GTTCAACGAG
TCGCCGGTGG ACCTGCTGGG CCGCATCGCC GAGATCGGTG CCAGCACGCG GACGATGCCG
CGGGAATGGG CCGACTGGTT CACGCTGGCC AGAATGCCCG CGCTGCGGAT TCCGGACTTC
AACATGAGCT CGGTGAGCCC GGCGAGCTGA
 
Protein sequence
MSEGLLVTGT TQAIRASHEI LERALELSRA DGAIVIASES STVNLRWANN TLTTNGAARD 
RSVTIISVIG RSFGVRSTST VDGPGVGADL AGLEALVRAS EAAARESDDA EDYSDLVVPA
AAAVGGRSFT DPAERTSSEV FASFADDLAE AFGAARAEGR RLFGFAEHDR TTTWLGTSTG
LRLRHSQPTG SVEWNAKSAA PGGSVWHGQS TQDFTDVDVA ATDARLRARL RWCDRSVELP
AGRYETLLPP SAVADLMVYL YWSAAGRDAA EGRTVFSRAG GGTRVGEALG PAGLRLWSDP
SAAGLTSAPF VTAGASSATS SVFDNGLPLG PTDWIRDGRL NALVQTRSSA RAASLAAPAA
APNSTSNSSG NGISAGAAGL AGQGIAVTPF VDNLLLDGGG TATLDEMITS TRRGLLLTCL
WYIREVDPQV LLLTGLTRDG VFLIENGEVV GAVNNFRFNE SPVDLLGRIA EIGASTRTMP
REWADWFTLA RMPALRIPDF NMSSVSPAS