Gene Francci3_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3332 
Symbol 
ID3904118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3949543 
End bp3950892 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content72% 
IMG OID637880657 
Producttype II secretion system protein E 
Protein accessionYP_482418 
Protein GI86742018 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0077596 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC AGTCGAGCAC CTCCGGGTCC GACGGCGGGT CCGATGTCGC CGAGGATGTA 
GCGAGCACGA TCAACACGGC GGTGCAGCGT CAGGTGGCGG CGGCGCGCCG CAGCGGCCGG
CGGTTCACCC CCACGGAACG CGCCGCGCTC GCCGAGGAGC TCCTCGCCCG GGAACTGGCC
GACATCCGGC GCGGCGCGGC CGACGCGCCG CCGCTGGACG CGGCCGGGGA GAACGAGGTG
CGGCTGCTGG TCCGCCAGGC CCAGTCGCAG CTGGGTTCGC TCGGCCCGTT CCTGCTGGCG
GACCGGTTTT CCGATGTGGA GGTCAACGGC GCGGTCAACC TGGTGCTGAC CGAGCGCGGC
AGCGGGCATC GGATCGAGGG ACGGTCTCCG TTCGGTAGCG ACGCCCAGGC GTTTGAGTGG
GTGGCCGAGC ATGCGGCGTC GGTCGGCCGC CGGTTCGACG AGAGCAACCC GTCGGTGCGG
TTCCGGCTGC CGAACGGGGT TCGGGTGCAC GCGGTGTCCC GGGTGACTCG CCTGACCCAT
ATCGACTGCC GGTTGTTCCG GCCCGGCCTG GACACCCTGG ACGGGCTCGC CGACGCGGGG
ATGTTCGGAT CTGACATCAC CGCCCTGCTC GCGGGGACGG CGGCTTTGCG TCAACCGTTT
GGGCTGATCA TCTCGGGTGG GACGGGAGCG GGGAAGACGA CGCTGCTGCG GGCGTGGGTC
AACGCCACAC CCGACGATCC GATCCTCGAC CGGATGGTGA CGGTGGAGGA TGAGCAGGAG
CTGTTCCTGG CCCCGGAGCG GTTCCGCAAC CTGGTGGAGT TCGAGGCCCG CGAGCGCAAC
GTCGACGGCC GCGGCGAGTA TTCGATGGCG CGGTATCTCG CGGAGAACCT GCGCCGTCAG
ACCCCGCACC GGGTCCTGCT CGGGGAACTG CGCCCCGACG GCGGCGTCCT GCCGCTGCTG
CTGGCGCTCG GGCAGGGCAT CGCCCAAGGG GTGGCGACGA CGATCCACGC ACCGAGCGCC
GCCGACGTCG TCGCCCGGCT ACGCACGTAT GCGGCGTTCG ACCCGGGGCG GGTGCCGGAG
GCGGCGGTGT TGGAGACCAT CGCGTCCACC GTCGATCTGA TCGTGCATGT CGCGAACCTG
GACGGCCGGC GGGTGGTCAC GAGCGTGCAT GAGGTCGGGG AGTACCGGGA GGGCCGGGTG
ACCTCGGCGG AGCTGTGGCG CTGGGACGCG AGGATCGAGC GGGCGGTACG CACGGACCTG
GACTTCTCCG ACCAGCTCGC CGCCAAGCTG CGTTCCGCCG GGGTCGGCCC GGCGGTCCTC
ACCCGGCGCC GGACGAGGGC GGCCTGGTGA
 
Protein sequence
MSLQSSTSGS DGGSDVAEDV ASTINTAVQR QVAAARRSGR RFTPTERAAL AEELLARELA 
DIRRGAADAP PLDAAGENEV RLLVRQAQSQ LGSLGPFLLA DRFSDVEVNG AVNLVLTERG
SGHRIEGRSP FGSDAQAFEW VAEHAASVGR RFDESNPSVR FRLPNGVRVH AVSRVTRLTH
IDCRLFRPGL DTLDGLADAG MFGSDITALL AGTAALRQPF GLIISGGTGA GKTTLLRAWV
NATPDDPILD RMVTVEDEQE LFLAPERFRN LVEFEARERN VDGRGEYSMA RYLAENLRRQ
TPHRVLLGEL RPDGGVLPLL LALGQGIAQG VATTIHAPSA ADVVARLRTY AAFDPGRVPE
AAVLETIAST VDLIVHVANL DGRRVVTSVH EVGEYREGRV TSAELWRWDA RIERAVRTDL
DFSDQLAAKL RSAGVGPAVL TRRRTRAAW