Gene Franean1_3364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3364 
Symbol 
ID5671735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3987893 
End bp3989848 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content71% 
IMG OID641242252 
Productendothelin-converting protein 1 
Protein accessionYP_001507672 
Protein GI158315164 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATCC TCGACGACGC CCGCGAGGGC ATGGACCTCG ACGTCCGACC GCAGGACGAC 
CTGTTCGGCC ACGTGAACGG CCGGTGGCTC GCCGAGACGG AGATCCCGTC CGACCGGTCG
AGCTGGGGCC CGTTCGTGCA GCTGGCCGAT GACGCCGAGC GGCAGGTCCG CGACATCATC
ACCGACCTCG CCGCGCGGGA CCAGGCCACC CAGGGCGAGG ACGCGCGGAA GATCGGCGAC
CTCTACAACT CCTTCATGGA CACCGAGGCG CTCGAGGCGC TCGGCCTGGG CCCGGTGCGG
CCGCTGCTGG ACGGCGCGCG CGGGCTGAGC GACGTCCGCG GCCTCGCCGC GTTCCTCGGC
GAGCTCGAGC GGATCGGCGG CGCCGGACTG TTCGGCTCCT ACGTCGACAC CGACGACCGC
AACTCGGACC GCTACCTGTT CCACCTGCGC CAGGGCGGCC TCGGCCTGCC GGACGAGTCG
TACTACCACG ACGACAAGTT CGCCGCGACC CGCCAGAAGT ACGTCGACTA CCTGACCCGG
ATGCTCGGGC TGGGCGGGCA CCCCGACCCC GAGGGCGCCG CGCAGCGGAT CCTCGACGTG
GAAACCCTGC TCGCGAAGGG CCACTGGGAG CGGGCCGAGA CCCGCGACGT CCAGAAGACC
TACAACCTGA TGACGGGCGG ACAGCTGGCC GCGCTCTGCC CGGCGTTCGA CTGGGACGCC
TACGTCACCG GTCTCGGTGG GTCGCTGACC GGCCCGCACG CGACGCTGGC GGAGGCGTGC
GTGCGGCAGC CGTCGTTCTT CGAGCACCTG TCGACCGTGC TGACCGACAC GCCGGTCGAC
GTGTGGCGCG ACTGGCTGGT CAGCCGCGTG CTGCGCTCGG CCGCGGCCTA CCTGCCCGAC
GTGTTCACCG AGACCCACTT CGACTTCTAC GGCCGCACGC TCAGCGGCAC GCCCGAGCTG
CGGGCCCGCT GGAAGCGCGC GGTGGCGTTC GTCGAGGGCG CGATCGGGGA GTCTGTCGGC
AGGGAGTACG TCGCCCGGCA CTTCCCGCCG CACGCCAAGG CGCAGATGGA CGACCTCGTC
GCGAACCTGC TCGCGGCCTA CCGCTCGTCG ATCTCCCAGC TGGACTGGAT GACGGAGGAG
ACCAAGCAGC GGGCGTACGA GAAGCTCGAG ACGTTCCGGC CGAAGATCGG CTACCCCGAC
CGGTTCCGGG ACTACTCGGC GCTGCCGGTC CGCCGCGGCG ACCTGATGGG CAACGCCCGC
GCAGCCGCCG CGTTCGAGAC CGACCGGGAG CTGGCCAAGA TCGGCTCGCC GGTGGACCGC
GACGAGTGGT TCATGCTCCC GCAGACCGTC AACGCCTACT ACAACCCGGG CACCAACGAG
ATCTGCTTCC CGGCCGCCAT TCTCCAGAAG CCGTTCTTCA GCCCGGACGG CCACCCGGCC
GAGAACTACG GCGGCATCGG CGCGGTGATC GGCCACGAGG TCGGTCACGG CTTCGACGAC
CAGGGCGCGC AGTACGACGG CGCCGGCAAC CTCAACGACT GGTGGACGCC CGCCGACAAG
GCGGCCTTCG AGGTGAAGTC GAAGACGCTG GTCGAGCAGT ACAACGGGTT CGAGTCGCGC
AACCTGCCGG GCGAGAAGGT GAACGGCGCG CTCACTGTCG GGGAGAACAT CGGCGACCTC
GGCGGCCTGA CCATCGCCCA CCAGGCCTAC GTCATCTCCC AGGACGGCGA GCCGTCGCGG
GAGGACCGAC GTCGGCTGTT CATGAACTGG GCCTACGTGT GGCGCTCCAA GCGCCGGCTC
GAGCTGGAGC GGCAGTACCT GACCACCGAC CCGCACAGCC CGCCGGACCT GCGCGCCAAC
ATCGTGCGCA ACCTCGACGA GTTCCACGAC GTCTTCGGCA CCGAGCCCGG CGACGGGCTG
TGGCTGGACC CGGCCGACCG GGTCCGCATC TGGTAG
 
Protein sequence
MNILDDAREG MDLDVRPQDD LFGHVNGRWL AETEIPSDRS SWGPFVQLAD DAERQVRDII 
TDLAARDQAT QGEDARKIGD LYNSFMDTEA LEALGLGPVR PLLDGARGLS DVRGLAAFLG
ELERIGGAGL FGSYVDTDDR NSDRYLFHLR QGGLGLPDES YYHDDKFAAT RQKYVDYLTR
MLGLGGHPDP EGAAQRILDV ETLLAKGHWE RAETRDVQKT YNLMTGGQLA ALCPAFDWDA
YVTGLGGSLT GPHATLAEAC VRQPSFFEHL STVLTDTPVD VWRDWLVSRV LRSAAAYLPD
VFTETHFDFY GRTLSGTPEL RARWKRAVAF VEGAIGESVG REYVARHFPP HAKAQMDDLV
ANLLAAYRSS ISQLDWMTEE TKQRAYEKLE TFRPKIGYPD RFRDYSALPV RRGDLMGNAR
AAAAFETDRE LAKIGSPVDR DEWFMLPQTV NAYYNPGTNE ICFPAAILQK PFFSPDGHPA
ENYGGIGAVI GHEVGHGFDD QGAQYDGAGN LNDWWTPADK AAFEVKSKTL VEQYNGFESR
NLPGEKVNGA LTVGENIGDL GGLTIAHQAY VISQDGEPSR EDRRRLFMNW AYVWRSKRRL
ELERQYLTTD PHSPPDLRAN IVRNLDEFHD VFGTEPGDGL WLDPADRVRI W