Gene Franean1_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0943 
Symbol 
ID5669357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1103070 
End bp1104116 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID641239870 
Productpeptidase S16 lon domain-containing protein 
Protein accessionYP_001505305 
Protein GI158312797 
COG category[T] Signal transduction mechanisms 
COG ID[COG3480] Predicted secreted protein containing a PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.652733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCC GCGCGCAGAC ACTCTTCGTT GCGTCGGTGC TGACGCTGAT CCTCGCCGTC 
GCGGGCCTGT GGCTGCCGGT GCCCTTTGTG ACCTTTGCGC CCGGTCCGGT TACCGACACT
CTCGGTAAGG TCGACGGGGC GCCGCTGATC GAGATCGACG GCCGCAAGAC CTACCCGACG
GCCGGCAGCC TCGAGCTGAC CACCGTCGAG GAGACGCCCC GGCTGAACCT GCTCGGCGCC
CTGCAGGACT GGTTCGCCTC CGACCGGGCT GTTGTGCCCA GCGAGCTGGT CCGTCCGCCG
GGCTCCAGCC AGGAGCAGAT CCGGCAGGAG AACACCCAGG CGATGATCGA CTCCCAGGAC
CAGGCGACCG CGGCCGCGTT GAGCGAGCTC GGCATCGCCC CGACCGGGGC AGAGGTCGTC
GTACACGAGG TGCTGTCCGA CTCGCCGGCC CAGGGCCTGC TGCGCGCCGG TGACGTCATC
ACCTCCGTCG GCGGCGTCGC CATCACCGGT CAGGACGGCC TGCGGGAGCA GATCGGCCGG
GTGAAGCCCG GTGAGGTGGT CGAGGTCGCC TACCGGCGGG ACGGCACCGC CGGCACCGGG
CGGATCACCA CCAGGCCGGC GACGGACGAT CCGACGCGGC CGATGATCGG GGTGACGACG
ACGGAGAAAC GGTCGTACCC GTTTACCGTC CGTATTCGGA TTTCCGACAT CGGCGGGCCC
AGCGCCGGCC TGATGTTCGC GCTGGGCATC GTCGACCTGC TCACCCCCGG CGAGCTGACC
GGCGGGAAGA CGGTCGCCGG CACCGGGACC ATCGACGCCG CTGGCGAGGT GGGCCCCATC
GGGGGCATCC AGCAGAAGAT CCTCGGCGCG CAGCGGGCCG GCGCCTCGGT CTTCCTGGTC
CCGAAGGGCA ACTGCGCGGA CGCCGTCCGG ATGAACACCG ACCTGCGGCT TGTCCAGGTG
ACGAACCTGT CGGGAGCCCT GGACGCGCTC AACACCCTGC GGGCCAAGCC CGACGCCACC
AACGTGCCCA CCTGCTCCGC GGCGTAG
 
Protein sequence
MGRRAQTLFV ASVLTLILAV AGLWLPVPFV TFAPGPVTDT LGKVDGAPLI EIDGRKTYPT 
AGSLELTTVE ETPRLNLLGA LQDWFASDRA VVPSELVRPP GSSQEQIRQE NTQAMIDSQD
QATAAALSEL GIAPTGAEVV VHEVLSDSPA QGLLRAGDVI TSVGGVAITG QDGLREQIGR
VKPGEVVEVA YRRDGTAGTG RITTRPATDD PTRPMIGVTT TEKRSYPFTV RIRISDIGGP
SAGLMFALGI VDLLTPGELT GGKTVAGTGT IDAAGEVGPI GGIQQKILGA QRAGASVFLV
PKGNCADAVR MNTDLRLVQV TNLSGALDAL NTLRAKPDAT NVPTCSAA