Gene Franean1_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2138 
Symbol 
ID5670538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2566594 
End bp2567565 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content71% 
IMG OID641241059 
Productsortase family protein 
Protein accessionYP_001506480 
Protein GI158313972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCGA CGAATCGCCG GCACCTGGCG GATTCGCTGA CGGATCTGCC CGTCCAGCCC 
GGAGCCGGAC GTCCTCCGGC CCCGGCGTCG CCGACCGCGG ATCCGCACCT GGGGGAGCCG
GCGCATCGGG GCGCGCGGGC GGCACCCGGA GGGCCTGGCC GGAGCCGGGG CGCTCGGCGG
CCCGGGCTGG TTCGCCACAC CCCCTGGCCA CGGCGGGTCC AGCCCGATCG AGTGGACGGC
CGGAGACGCC GGCGTAACCC GGTGGCGGGG CTGGCGGATC GGCCGGTTGG CGGTCGGGTC
TCCCGGGGCC TCGGCGAGGT CATGATCACA GCGGGTCTGG TGGTGGTGCT CTTCCTCGCC
TACCAGCTGT GGATCACCGA CATCTTCGCG GCCAGGACGC AGGACCGGCT CCGAAACGAC
CTGACCACGG CGTGGTCTCG ACAACCGCAT CCTCGGGCTC CCGCCGAGGC TGCGAAACCA
CGACCGGTCG TGCCGCCCGT CGAACTGGGC GAGGGAGTCG CTGTCCTGCG GGTCCCCCGC
TTCGGTGCCG ACTACGCACC CGTGGTTGTG GAAGGTGTGT CGGTGGCGGC GCTACGCCGC
GGACCTGGGC ATTTCCCGGG CACTGCCATG CCGGGCGACG TAGGGAACTT CGTCGTGTCC
GGTCACCGCA CCACGTATGG AAAGCCGTTC AGCCGGCTGG ACGAGCTGAG AGTGGGCGAT
CCGCTCGTGG TGGAGGTGGC AGACCGGTAT TTCACCTACC GGGTCACCGG CTCGGAGGTC
GTGGACCCCC ATCGGCTGGA CGTGACCTAC CCGGTTCCAG GGCACGCCGG AGTCGCTCCC
ACCAGGGCGT TGATGACACT GACCACCTGC CATCCACGAT TCTCGGCGCG GAGTCGACTC
ATCGTCTTTG CCAACCTCGA CGAGACCACG GACAAGTCCG ACGGACCACC TCGCGCGCTC
GCGGACGAAT AG
 
Protein sequence
MPATNRRHLA DSLTDLPVQP GAGRPPAPAS PTADPHLGEP AHRGARAAPG GPGRSRGARR 
PGLVRHTPWP RRVQPDRVDG RRRRRNPVAG LADRPVGGRV SRGLGEVMIT AGLVVVLFLA
YQLWITDIFA ARTQDRLRND LTTAWSRQPH PRAPAEAAKP RPVVPPVELG EGVAVLRVPR
FGADYAPVVV EGVSVAALRR GPGHFPGTAM PGDVGNFVVS GHRTTYGKPF SRLDELRVGD
PLVVEVADRY FTYRVTGSEV VDPHRLDVTY PVPGHAGVAP TRALMTLTTC HPRFSARSRL
IVFANLDETT DKSDGPPRAL ADE