Gene Franean1_6528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6528 
Symbol 
ID5674843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7938319 
End bp7939497 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID641245376 
ProductNLP/P60 protein 
Protein accessionYP_001510771 
Protein GI158318263 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.203343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGGG CAGCCCAGCC CGAACCCGAC AGCTCACCTC GCAGGCGTAC GGGAAGGAAC 
GCAACGTTGT CTGCGCAAAG TGTGCAGCTT GATGCTGAGA GACGCCGGCA GGAAGGGCGC
GGTCGCCACC GTGCGCCGTC CGCCCCCACG GCGTCCAGCC GAGCCAGAGC CAGGGCCCGT
ACCGTCGCGG CCATCACCAC CGGAACCGTC GCGGTCTCCG GGGTGGCGCT CGCCGGGTGC
GCCCAGGATG TCAACTCGGA CGTCGCGCTG GACGACGGCA CGAACACCGC CCCGATGACG
CTCGCGACCC AGATCGGGTC CCGGCTGGCG GTCGACGGAG CCATCCAGGC CGCCACCGCT
ACCGGCGGCG GTTCGGGCAC CGTTCTCGAC GCCACCACGG ACATCTCCGC GCCGACTCTC
TCGTCGAAGA TCGACGTCGG CCTGCGCGTG ACGAACCCCG AGGTCACGGT CAACGCGGAC
GAGCCCGTCA ACATCGGCTT CTCGCTCTAC AACGAGGAGA CCCAGGCCCC GATCCCGGAC
CAGCTCATCA AGGTCCAGGT CAAGCTGCCG ACCGGGTGGG CGACCTTCCT GCACCTGACG
ACGGACGACC GCGGCTTCGC GTCGTACACC GCGAAGGTGC TCACCACCAC GAATGTCACG
GCGATCTTCG ACGGCACTGA CGCCCTCCAG TCGGCGCACT CGGAGAACGA CGCCACGCTG
CACGTCCGCC CGGCTCCGCC GCCGGTGCCC GCACAGGCCT CCCGCAGCGC GGACCGCACG
GGCGTGAACG TCAACACCCC GGTGGTCAGC GTCAACCTGC CGACCAACAC CCTCGGTGAG
AAGGCCGTCT ACCTGGCCTC GCTACAGGCC GGCAAGCCGT ACGTCTACGG CGCCGAGGGT
CCCAACGCCT TCGACTGCTC CGGTCTCGTG CAGTACATCT ATCGGCAGCT CGGCAAGAGC
CTGCCGCGTA CCACCGACCA GCAGTACGCG GCCACCACCC ATATCTCCCA GTACAACAAG
GCGCCCGGCG ACCTGATCTT CTTCGGGAGC CCCGGAAACA TCTACCACAT GGGCATCTAC
GCCGGCGACG GCAAGATGTG GGTCGCGCCC CGCACTGGTG ATGTCGTCAA GCTTCAGACG
ATCTACACGA CCTCGTACTT GGTCGGCCGC GTCACCTGA
 
Protein sequence
MDRAAQPEPD SSPRRRTGRN ATLSAQSVQL DAERRRQEGR GRHRAPSAPT ASSRARARAR 
TVAAITTGTV AVSGVALAGC AQDVNSDVAL DDGTNTAPMT LATQIGSRLA VDGAIQAATA
TGGGSGTVLD ATTDISAPTL SSKIDVGLRV TNPEVTVNAD EPVNIGFSLY NEETQAPIPD
QLIKVQVKLP TGWATFLHLT TDDRGFASYT AKVLTTTNVT AIFDGTDALQ SAHSENDATL
HVRPAPPPVP AQASRSADRT GVNVNTPVVS VNLPTNTLGE KAVYLASLQA GKPYVYGAEG
PNAFDCSGLV QYIYRQLGKS LPRTTDQQYA ATTHISQYNK APGDLIFFGS PGNIYHMGIY
AGDGKMWVAP RTGDVVKLQT IYTTSYLVGR VT