Gene Franean1_5005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5005 
Symbol 
ID5673344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6002880 
End bp6003923 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content76% 
IMG OID641243859 
Productpeptidase C60 sortase A and B 
Protein accessionYP_001509275 
Protein GI158316767 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.387743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGTC GCTGGACGGA GACAGTCGTG AAGCGTGGAG CGACGGAGCG GTCGCCGGTG 
CGTATTACGT TCTCCCGCCT GCTGCTCGTC GCGGGCGTGG TACTGGTCGT CGGGAGCGGG
TTCCGGCTGA CCGAGTCCGA TCCCGCCCCA CCCTCGGACA CCGAGCGGGT CGACGTCGGC
GGCCTCGGTA CCGCGATCGG GCAGGCCGGC GCGCTCGACG GGACGGCCGC TCCCGGCGGC
CCGGCGGCCG CCACGCCGGT GCCGACGCTC CCGCCGGTCG TGGTCACCGA CCCGACGCAC
GTGCGGATCC CGGCGATCGG AGTGGACGCG GCGATCGTCC GGCTCGGGCT GAACGCCGAC
GGCTCGCTGG CCGTACCGAA GAAGTGGCAG GAGGTCGGCT GGTACGACCG GGGCCCGGCG
CCCGGCGCGC TCGGCCCGTC GGTGCTGGTT GGCCACTACG ACTCGACCTC CGGGCCGGCC
GTCTTCTACC GGCTGCGGGC GTTGCGCGCC GGGGACCAGA TCGAGGTCAC CTCGCCGACC
GGGCTGCGCA CGACGTTCAC GGTCGACCGC ACGGAGGACG CGACCAAGAA GGCGTTCCCG
ACCGATCGCG TCTACGGCCC GGTGGGCCGG CCGGAGCTGA GGCTGATCAC CTGCGGCGGG
GCGTTCGACG AGGACACGAA CCACTACCTG AGCAATCTGA TCGTCTACGC GCACGCGAAC
AGCCCGTCCG TGCCGGGCGC CCAGCCGCCC GGCACCGGGG CGCCGCCCGC CGGCGCCCCG
CCCCCGGCCG TGTCTGCGCC GGCCGTGTCT GCGCCGGCCG TGTCCACTCC AGTCGCGCCC
ACTCCAGTCG CGCCGGCGCC CGCCATCCCT GTGCCGGTCG GGGCGGCACC CGCAGCCCCG
GCGTCTGTCG TGCCCGTGCC TGTCGCGCCC GCTACGGCCA TGCCCGCTAC GGCCATGCCC
GTGCCAGGCG TGCCCGTGCC CGTGCCCGTG CCAGGCGCGG GCGCGGCGCC GAACGCCCCC
GCTCCGCCGG CCCCAACCGG CTAA
 
Protein sequence
MDGRWTETVV KRGATERSPV RITFSRLLLV AGVVLVVGSG FRLTESDPAP PSDTERVDVG 
GLGTAIGQAG ALDGTAAPGG PAAATPVPTL PPVVVTDPTH VRIPAIGVDA AIVRLGLNAD
GSLAVPKKWQ EVGWYDRGPA PGALGPSVLV GHYDSTSGPA VFYRLRALRA GDQIEVTSPT
GLRTTFTVDR TEDATKKAFP TDRVYGPVGR PELRLITCGG AFDEDTNHYL SNLIVYAHAN
SPSVPGAQPP GTGAPPAGAP PPAVSAPAVS APAVSTPVAP TPVAPAPAIP VPVGAAPAAP
ASVVPVPVAP ATAMPATAMP VPGVPVPVPV PGAGAAPNAP APPAPTG