Gene Francci3_4297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4297 
Symbol 
ID3907265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5127815 
End bp5129083 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content72% 
IMG OID637881624 
Producttype II secretion system protein E 
Protein accessionYP_483372 
Protein GI86742972 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0512015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGC TCGGCGAGCG ACGCAGGTCC CCATTCCCCG CCACAGGTCA CACGGCCTCA 
CCGGCTGCCC GAGCGGCCGC GAGGCCCGCG GTCTCCACAC GATCTTCACC GCCGTCCGCA
ATTCCATCGG AACCGCTCGA ACCGTCAGAA CCGCTCGAAC CGTCAGAACC GGTGACCGCA
CCCGCGGAGC CGCCCGGAGC CGGCCCGCTG GATCCCCTCC TCGCTGATCC GCACGTCACC
GATGTCCTGG TCAACGGGCC CGGTGAGGTC TGGGTGGAAC GCCGGGGCCG GCTCATCCGT
ACCTCGGTCG CGTTCGCGGA CGAGGAAGCG GTGCGGCGCC TGGCCGTTCG TCTCGCCGCG
ACGGCCGGGC GGCGCCTGGA CGCCGCGATG CCCTTCGCTG ACGTGCAACT GCGCGACGGA
ACCCGGTTGC ACGCGGTTCT GGCACCGATC GCGGTGCAGG GTACGTGCCT GTCGCTGCGA
CGGACCCGGC GTCGCCCCTT CACCTTCGAC GAACTCGTGC TCGCCGGAAC AATGAGCCCG
GCGGTGGCGG GCGTGCTGCA CGCGGTCCTG AGCGCCCGAC TTGCGATCGT GGTGACCGGG
GGCACCGGCT CGGGCAAGAG CACACTGCTT GCCGCACTAC TGGGAGCGGT GCGCCCCGAC
GAACGGATCG TGCTGGTGGA GGACACCGCC GAGCTCGTCA TGGACCGTCC GGGGATGGTG
CGGCTCGAGG CCCGCCCGCC GAACATCGAG GGCGCTGGCG AGGTGACCCA GCGCGATCTG
GTCCGCCAGG CGCTGCGCAT GCGCCCGGAT CGGCTGGTGC TCGGCGAGGT CCGCGGCCCC
GAGGTGCTCG ATCTCCTCGT CGCCCTCAAC ACCGGTCACG AGGGCGGGCT GAGCACCGTG
CACGCCAACG ACACCAGTGC GCTGCCCACC CGCCTGGAGG CTCTCGCCGC CCTTGCCGGG
CTGTCCCGCC CCGCGGTCCA CAGCCATATC GCGGCAGCGC TCCACACCGC CGTTCATCTT
TGCCGGGAGG CCGACGGCCG CCGTCGGGTC TGCGCGATCG GGGTGTTCCG CCAGACGGAC
AGCGGACTGG TTCAGGTCGT GCCGGCCCTG GTCGTTCCTG CCGCACCTTC CCGGAACGGG
CCAGCCCTGC GCGCCTGCGA CCCGACGCCG GCGGAGGGCC TGCCGATCCT GCGTGACCTG
CTGGACGTGC GCGGTGTCTC GCTTCCAGCC CTGGTTGACG CCGGGTCGCA GGTGCGGAGG
CGAGCATGA
 
Protein sequence
MTGLGERRRS PFPATGHTAS PAARAAARPA VSTRSSPPSA IPSEPLEPSE PLEPSEPVTA 
PAEPPGAGPL DPLLADPHVT DVLVNGPGEV WVERRGRLIR TSVAFADEEA VRRLAVRLAA
TAGRRLDAAM PFADVQLRDG TRLHAVLAPI AVQGTCLSLR RTRRRPFTFD ELVLAGTMSP
AVAGVLHAVL SARLAIVVTG GTGSGKSTLL AALLGAVRPD ERIVLVEDTA ELVMDRPGMV
RLEARPPNIE GAGEVTQRDL VRQALRMRPD RLVLGEVRGP EVLDLLVALN TGHEGGLSTV
HANDTSALPT RLEALAALAG LSRPAVHSHI AAALHTAVHL CREADGRRRV CAIGVFRQTD
SGLVQVVPAL VVPAAPSRNG PALRACDPTP AEGLPILRDL LDVRGVSLPA LVDAGSQVRR
RA