Gene Franean1_5736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5736 
Symbol 
ID5674062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6970688 
End bp6971668 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID641244589 
ProductNLP/P60 protein 
Protein accessionYP_001509992 
Protein GI158317484 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGGA TACCGCTGCG TGTGTCGGCG TCGCAGCGTC GCGTAGCGAC TCTTGCCCTC 
ATGACGACCG GCGGCTTCGC CGCCGCGTCG ACTGGCTTCG CGCTCGCTAC CACCGGCGAA
GCTGCCTCCG CGGCATCACC GGTGCCCACC ATCCGCACGG CTGGCATCCT CCCTGCGGGC
GCCCAGCCGG CTGCAGCGGA CGGTGAGGAA GCGGCGCTGG CCGCCGCCAT CGCTGATCCG
TCGGAGAACT TCACCGGTCT CGCATTCGCT CCCGACCGGG CGACCGTCGC GCCCAACGGC
GAAGTGATGT TCACCGTCCG TGCGACCAGG GCCGACGGCG CACCACTAAT CGGGTCCGCC
GTCCGCATCG TGTCCGTCAA CGGACCGAAA TGGACCACCA CGGCCACACT CCGAACGGAC
GCCGCAGGCG AGGCGCGGAT ATCTACTCGA CTGCTATCCA CGACTACCCT CACTGCCGTG
TTTGACGGCT CCGGCGCGCT GCGTCCCTCA ATGGCCGGAA CAGCGACGGT CACCGTCCAG
GCTCCACCCG CGGCAGCGAG GTCCGCCCGC ACCGGTGGAG GGCAGCCAGG TGCCGTACCC
ACACCGATCT ACGGCTCGGT CCCGGCAAGC GAGATCGGCG CGAAGGCCGT TTATCTTGCG
TCCCTGCAGA ACGGCAAACC GTACGTCTAC GGCGCCGCAG GACCTTACGC CTTCGACTGC
TCGGGATACG CCCAGTACGT CTACCGGCAG CTTGGACGGA ACCTGCCGCG TACCGCCCAG
CAGCAGTTCC AGGCCACGAT ACGTATACCG AAATCCGGGA AGCAGCCCGG GGACCTCATC
TTCTTCGGCA CTCCATCAAA CATCACCCAC ATGGGCATCT ATGCCGGAAA CGGCTATATG
TGGGCTGCTC CAAGGACCGG AAGCAACGTC AAGCTGCAGC CCATCTACAG CTCCACCTAC
TATGTAGGCA GAGTTCGGTA A
 
Protein sequence
MGGIPLRVSA SQRRVATLAL MTTGGFAAAS TGFALATTGE AASAASPVPT IRTAGILPAG 
AQPAAADGEE AALAAAIADP SENFTGLAFA PDRATVAPNG EVMFTVRATR ADGAPLIGSA
VRIVSVNGPK WTTTATLRTD AAGEARISTR LLSTTTLTAV FDGSGALRPS MAGTATVTVQ
APPAAARSAR TGGGQPGAVP TPIYGSVPAS EIGAKAVYLA SLQNGKPYVY GAAGPYAFDC
SGYAQYVYRQ LGRNLPRTAQ QQFQATIRIP KSGKQPGDLI FFGTPSNITH MGIYAGNGYM
WAAPRTGSNV KLQPIYSSTY YVGRVR