Gene Franean1_5508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5508 
Symbol 
ID5673839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6672688 
End bp6675195 
Gene Length2508 bp 
Protein Length835 aa 
Translation table11 
GC content75% 
IMG OID641244363 
ProductATP-dependent protease La 
Protein accessionYP_001509769 
Protein GI158317261 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.528904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACA CCAGGCTCCT CCCGCTGCTG CCGCTGGACG ACGCGGTCGT CCTGCCCGGC 
ATGGTGGTTC CGCTCGACCT GTCGGACGCC GGGACCCGCG CCGCCGTGGA CGCGGCCCGC
GGCGGCAGGC CCGACTCCAC ACCCGACGCC AGGGCGCCGG GCATCTCCAG CCGCTCCTTC
CAGCGCACCG CCGAGATCCT GCTCGTCCCC CGGGTGGACG GCGACCTCGC CGACATGGGC
GTCCTCGCCG TCGTCGACCA GATCGGCCGG CTGCCGAACG GCGGGACGGC CGCGCTCGTG
CGGGCCGTGT CCCGCGCCCG GGTCGGCACC GCCCCCACGC CGCCCGGCGC CGCCGACGCC
GGTGTCGTGT GGGCGGAGGC GACCCCTGTC GAGCCGGTCC TCCCGGCCGG CTTCACCGGC
ACGATCAGCG CGACGGCGAC CGCGGGCGCG CCGGGCGCCA CCGGCGACCC CGACGGCCCG
ACCGCCCGGC TGGTCGAGCT CGCCCGCGAG TACCGCACGC TGGTGACCGG GGTGCTGCGC
GCCCGCGGGG TCGGCCAGGT CGCCGACACC GTCGAGGCGA TCGAGAACCC CGACACGCTG
GCCGACACCG CCGGCTACTC GTCCTACCTG TCGACCGCGC AGAAGCTGGA GCTGCTGCGC
ACCGTCGACG TGACCGCCCG GCTGGAGCTG CTCGTCCCGT GGACTCGGGA GCACGCCGCC
GAGGTCGACG TGGCCGAGAC CATCCGGCGC GACGTCCAGG AGGGCGTCGA CCGCCAGCAG
CGGGAGTTCC TGCTGCGCCG TCAGCTGGAG GCCGTCCGCA AGGAACTGGC CGAGCTGGAC
GGGTCACCCG CGTCCTCCGA ACAGGAGGAC TACCGGGCCC GGGTCGAGGC CGCCGACCTG
CCGGAGAAGG TCCGCGCCGC GGCGCTCAAG GAGGTCGACA AGCTGGAGCG GACGGCCGAG
TCCTCCCCCG AGGTCGGTTG GATCCGCACC TGGCTGGACA CCGTCCTGGA GCTGCCGTGG
AACGAGCACG CCGAGGACAC CTACGACATC GCCTCGGCGC GGGCCGTGCT GGACGCCGAC
CACGCCGGGC TCAGCGACGT CAAGGACCGG ATCGTCGAGT ACCTGGCGGT CCGCCGCCGG
CGGGCCGACG CCGGGCTCGG GGTCGTCGGT GGGCGCCGCA GCGGCGCGGT GCTCGCGCTT
GCCGGGCCGC CCGGGGTCGG CAAGACCTCG CTGGGCGAGT CCGTGGCCCG CGCCATGGGC
CGCCGGTTCG CCCGGGTGGC GCTCGGCGGC GTCCGCGACG AGGCGGAGAT CCGCGGCCAC
CGGCGCACCT ACGTCGGTGC GCAGCCCGGT CGCATCGTGC GGGCGATCCG CGAGGCCGGC
TCGATGAACC CGGTGATCCT GCTCGACGAG GTCGACAAGG TCGGCTCCGA CTACCGCGGG
GACCCGACGG CGGCGCTGCT CGAGGTGCTG GACCCGGCGC AGAACCACAC CTTCCGCGAC
CACTACCTGG AGGTCGAGCT CGACCTGTCG GACGTGTTGT TCCTGGCCAC CGCGAACGTG
CTGGAGTCCA TCCCGGCGCC GCTGCTGGAC CGGATGGAGC TGATCCTGCT CGACGGCTAC
ACCGAGGAGG AGAAGGTCAC CATCGCCCGC GACCACCTGC TCCCGCGCCA GCTCGAGCGG
GCCGGGCTGA CCCCCGACGA CGTCACCGTG GACGACGCGG CGCTGCGGCT GCTGGCCGGC
GAGTACACCC GCGAGGCCGG CGTGCGTGAC CTGGAGCGGG CGACCGCCCG GGTGCTGCGC
AAGGTCGTCG CGAAGGTCGC ACTGGACGAG ACCGCCCTGC CGGTCACGAT CGGCGCCGGC
GATCTGGCCG GATACCTCGG ACGGCCGCGG CACACCCCGG AGTCGGCGGA GCGCACCGCG
CTGCCGGGGG TGGCGACCGG GCTCGCGGTC ACAGGGGCCG GCGGGGACGT CCTGTTCGTC
GAGGCGTCAC TGGCCGACCC GGAGACCGGG GCCAGCGGAG TGACGCTGAC CGGGCAGCTC
GGCGACGTCA TGAAGGAGTC GGCGCAGATC GCGCTGTCCT ACCTGCGCTC GCGCGGGGTG
GAGCTCGAAC TGCCGGTCGG CGACCTGCGT GACCGCGGGG TGCACATCCA CGTCCCGGCG
GGCGCGGTGC CCAAGGACGG GCCGAGCGCC GGGGTCACGA TGACCACCGC GCTGGCATCC
CTGCTCTCCG GGCGCCCGGT GCGCGCGGAC GTGGCGATGA CCGGCGAGGT CTCGCTGACC
GGTCGGGTGC TACCCATAGG TGGGGTCAAG CAGAAGTTGC TCGCCGCGCA CCGCGCCGGC
ATCGCAACGG TGCTCCTGCC GGCGCGCAAC GGCCCGGACC TTGACGACGT CCCGGCAGCG
GTCCGCGAGG CGCTGACCGT GCACCTGGTT GCGGACGTCC GCGAAGTGCT TGAGCTGGCG
CTGGAACCGG CCTTCGACAC GACGCACACC CACGCCGTGG CGGCCTGA
 
Protein sequence
MSHTRLLPLL PLDDAVVLPG MVVPLDLSDA GTRAAVDAAR GGRPDSTPDA RAPGISSRSF 
QRTAEILLVP RVDGDLADMG VLAVVDQIGR LPNGGTAALV RAVSRARVGT APTPPGAADA
GVVWAEATPV EPVLPAGFTG TISATATAGA PGATGDPDGP TARLVELARE YRTLVTGVLR
ARGVGQVADT VEAIENPDTL ADTAGYSSYL STAQKLELLR TVDVTARLEL LVPWTREHAA
EVDVAETIRR DVQEGVDRQQ REFLLRRQLE AVRKELAELD GSPASSEQED YRARVEAADL
PEKVRAAALK EVDKLERTAE SSPEVGWIRT WLDTVLELPW NEHAEDTYDI ASARAVLDAD
HAGLSDVKDR IVEYLAVRRR RADAGLGVVG GRRSGAVLAL AGPPGVGKTS LGESVARAMG
RRFARVALGG VRDEAEIRGH RRTYVGAQPG RIVRAIREAG SMNPVILLDE VDKVGSDYRG
DPTAALLEVL DPAQNHTFRD HYLEVELDLS DVLFLATANV LESIPAPLLD RMELILLDGY
TEEEKVTIAR DHLLPRQLER AGLTPDDVTV DDAALRLLAG EYTREAGVRD LERATARVLR
KVVAKVALDE TALPVTIGAG DLAGYLGRPR HTPESAERTA LPGVATGLAV TGAGGDVLFV
EASLADPETG ASGVTLTGQL GDVMKESAQI ALSYLRSRGV ELELPVGDLR DRGVHIHVPA
GAVPKDGPSA GVTMTTALAS LLSGRPVRAD VAMTGEVSLT GRVLPIGGVK QKLLAAHRAG
IATVLLPARN GPDLDDVPAA VREALTVHLV ADVREVLELA LEPAFDTTHT HAVAA