Gene Franean1_6005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6005 
Symbol 
ID5674326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7323946 
End bp7324968 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content77% 
IMG OID641244853 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001510255 
Protein GI158317747 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0902481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.147252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACAC CGGCCCGACC GCTGGTGCTG GGCATCGAGA CCTCGTGCGA CGAGACCGGC 
GTCGGCCTCG TCCGTGGCGG CGAGTTGCTC GCCGACGCGC TCGCCTCCTC GGTCGACGAG
CACGCCCGGT ACGGCGGCGT GGTGCCCGAG ATCGCCGCGC GGGCGCACCT GGAGGCGATG
GTCCCGACGA TCGAGCTGGC GTTGGACCGC GCCGGCCTGC GCCCCCGGGA CGTCGACGCC
GTCGCGGTGA CATCCGGCCC GGGGCTGGCC GGCGCGCTGC TGGTCGGGGT CGCCGCGGCG
AAGGCGTACG CGCTGGCGCT GGGTGTCCCG CTGCACGGGG TGCACCATCT CGCCGCGCAC
GTCGCCGTCG ACACGCTTGA GCACGGCCCG CTGCCGCGCC CGGCGGTGGC GCTGCTGGTC
TCTGGTGGGC ACAGCTCGTT GCTGCTGGTC CCCGATCTCG CGGCCGAGCC GGTCGAGTCG
CTGGGGGCCA CGGTGGACGA CGCCGCGGGG GAGGCCTACG ACAAGGTCGC CCGGTTGCTC
GGCATGCCGT TCCCGGGTGG CCCGCCGATC GACGCGGCGG CCCGCGAGGG CAGCCCGCGC
ATCCCGTTCC CGCGCGCCAA GGCGGGGGAC GGCACGTTCG ACTTCTCCTT CTCCGGGCTC
AAGACCGCGG TCGCCCGCTG GGTGGAGGCC CGGCGGCGGG CCGGCGAGCC CGTGCCGGTC
GCGGATGTCG CGGCGTCGTT CCAGGAGGCC GTGGCGGACG TGCTCACCGC GAAGGCGGTC
GCGGCCTGCC GGGCGCACGG TGTGGACACC CTGGTGGTCG GGGGCGGTGT CGCGGCCAAC
AGCCGGCTGC GCGTGCTCGC CGCGCTGCGC TGCGAGGCGG CGGGCATCAC GCTGCGGATC
CCGCGGCCGG GGCTGTGCAC CGACAACGGC GCGATGGTCG CGGCGCTCGG GTCGCTGCGG
GTCGAGGCCG GCGTCGAGCC CTCGCCGCTG GACCTTCCCG CGTCCTCCAC GCTCGCCCTC
TGA
 
Protein sequence
MRTPARPLVL GIETSCDETG VGLVRGGELL ADALASSVDE HARYGGVVPE IAARAHLEAM 
VPTIELALDR AGLRPRDVDA VAVTSGPGLA GALLVGVAAA KAYALALGVP LHGVHHLAAH
VAVDTLEHGP LPRPAVALLV SGGHSSLLLV PDLAAEPVES LGATVDDAAG EAYDKVARLL
GMPFPGGPPI DAAAREGSPR IPFPRAKAGD GTFDFSFSGL KTAVARWVEA RRRAGEPVPV
ADVAASFQEA VADVLTAKAV AACRAHGVDT LVVGGGVAAN SRLRVLAALR CEAAGITLRI
PRPGLCTDNG AMVAALGSLR VEAGVEPSPL DLPASSTLAL