Gene Francci3_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0423 
Symbol 
ID3903247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp503262 
End bp504641 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content72% 
IMG OID637877754 
ProductNLP/P60 
Protein accessionYP_479539 
Protein GI86739139 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGTG TCCCCACGGA GGGCTGGCAT AGCGCCGGCG AACCGCGCCC CGGGCCCCGC 
CGTCCCCGTG ATGTCCCGCC ATCCGGGTTC GGCCGGCGAA CTTCGAAACA CGTCGCAAGG
CAGACTCCCG CCGCCGGCCC GACCGACGAT CCCCAGCCGA CCGGGAACGA TCCGACCGGA
TGGCGTCCGA AAGCCGGAGC GCCCATTGTT CTCGGACTGG TCGGCATTCT CGTCGCCGGA
CCTCCGGTTG TCTGGGGGTT TGCCGGATCC GCCTGGGCGG CGCCGACGAC TCCGGCCTCC
GGGTCGGCCC CGGAGAGCCC GGAGAGCCTG CAGAGCCTGC AGGCGGAGAT CAACGGTACC
CGGGTACGGC TGGACGAATC CACTCGTCAG ACGGCGATTG CCACTGAGGT GTTCAACGCC
GAACGTATTC GGCTGGCCGA GGCCGAGCGG GCCGCGGCTG CCGCGGCCGG GCGGGTCGAC
CGCGCTGACG ACGCTGTACG GCAGGCCTCG GACAAACACC GGGGGTTGGC GGTGTCGGCG
AATCGGGCCG GGGGATTCGG GCAACTGTCG TTGCTGCTGA CCGGCGACCC CCGACAGGTA
CTCGACCGGG CCGGTGCGGT CGATGCGCTG GCCCGCCGGC AACGCGTGGC GGACACTGGC
CTGCGGCTCG CGCGTCGGGA TCTCACCGAG GCACGCCGGA GCGCTGACGT GGCACTTGCG
GGCAAGAGGA AGATCGTCAT GCGGCTCGCC GCACGTAAGC GGTCCATCGA GGCGTCCGCC
GCCGAGCAGC GCAGCCTGCT GCAGCGGCTC GAATCCCGCT ACGCATCGCT GGAGCGGCGG
GCCAGGGAGC GCCAAGCCGC CGCCGCGCGG GCCCGCAGGG CGGCGGCGGC CGCCGCCGCG
GCATCGGCGG CCAGGAAGGC CGCGGCCGAA CGGGTCCGTT ATCGGAAGGA GTCGGCCGCG
GTCGCGGCCG CGGGCCGGGC GTTCGCCGCG GCCCCGACTA CCCCGGCACC CATCCCGCCG
ACGGGCGGCG GTGGCGCGTC GCGCGCGGTG CAGGAGGCAT ACGCCCAGCT GGGCAAGCCC
TACGTGTGGG CTGCGGCGGG GCCGAAGTCC TTCGACTGCT CCGGCCTGAC GCAGTGGGTC
TGGGGGAAGG CCGGGGTCTC GCTGAGCCAC TACACCGGAT CACAATGGAA TGAGGGGCGC
CGCGTGAACC GGGCGGGCCT CATTCCCGGC GATCTCGTCT TCTTCCATGC CGATCTTGAT
CATGTCGGGA TCTACATCGG GGGCGGGAAG ATGATCCACG CTCCGCGGAC CGGGGAGGTG
GTCAAGGTGG AGAAGATCTG GTGGTCGAGC TTCCGGGGGG GTGTGCGGCC GGGAGCGTGA
 
Protein sequence
MSSVPTEGWH SAGEPRPGPR RPRDVPPSGF GRRTSKHVAR QTPAAGPTDD PQPTGNDPTG 
WRPKAGAPIV LGLVGILVAG PPVVWGFAGS AWAAPTTPAS GSAPESPESL QSLQAEINGT
RVRLDESTRQ TAIATEVFNA ERIRLAEAER AAAAAAGRVD RADDAVRQAS DKHRGLAVSA
NRAGGFGQLS LLLTGDPRQV LDRAGAVDAL ARRQRVADTG LRLARRDLTE ARRSADVALA
GKRKIVMRLA ARKRSIEASA AEQRSLLQRL ESRYASLERR ARERQAAAAR ARRAAAAAAA
ASAARKAAAE RVRYRKESAA VAAAGRAFAA APTTPAPIPP TGGGGASRAV QEAYAQLGKP
YVWAAAGPKS FDCSGLTQWV WGKAGVSLSH YTGSQWNEGR RVNRAGLIPG DLVFFHADLD
HVGIYIGGGK MIHAPRTGEV VKVEKIWWSS FRGGVRPGA