Gene Francci3_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1736 
Symbol 
ID3906802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2070744 
End bp2072189 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID637879074 
Producttype II secretion system protein E 
Protein accessionYP_480841 
Protein GI86740441 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00343122 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.149878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC CGTCCTGGCC CGACCCCCGC CGCCCGGCGT CCGGTAACGG CGCGGGCAGG 
CCCGGTCCGT TTCTTCCCTG GACCGGTGGC GGTGGGGGTC CGTCCGCGCC GACCGATCTT
CCGGGCCGTG CCTCGGGGTC GGCGGCCGAC GCCCTGCGTG TCCGGCTCCG GGACGGGCTG
CGGGCCGCGC TCGCCCGCCG GCTGCGCGCG GACGAGGAGG CAGGCTCCCC GCCGCTGACC
GCGCAGGCGC GGGAGGCGTT CGCGCTGTCC GTGCTTGTGG ATCTGACCGA GGCGCACACC
ACCGCCGAGC TGGCCCGCGG TGCGGGGGTC CTGTCACCCG AGGACGAGCA GCGCGTCCTC
CACGAGGTCC TCGCCGAAGT CCTCGGACTC GGCGGCCTCG AACCGTTGCT CGCCGACCCG
TCGATCGAGA ACATCAACAT CAACGGGGAT CGGGTGTTCA TCCGCCGGGC CGACGGCAGC
CGACACCGGC TACCAGCGAT CACCGACTCG GACGCCCAGC TGGTCGGCCT GATCCGGGAC
CTGGCCGCGC ACGCCGGGGT CGAGGAACGG CGCTGGGACC GCGGCGCCCC CATGGTCAAC
TTCCACCTGG CCGACAAGAG CCGCGTGTTC GCGGTCATGG CGGTCACCCA GCGGCCGTCA
GTGAGTATTC GCCGGCACCG GTTCCGCCAC GTCACCCTCG CGGCGCTGCG CGCGAACGGC
ACGATCGACT ACGGGCTGGA AGCGCTGCTG GCCGCGCTGG TAGCGGCGCG GAAGAACATC
GTGGTCGCCG GGGGCACGGC GATCGGGAAG ACCACGATGC TGCTCGCATT GGCCGACCAG
ATCCCACCGC ACGAGCGGCT GGTGACCGTC GAGGACGTCT ACGAGCTCGG GCTGGACACC
GACGCGCAGG CCCATCCGGA CGTGGTCGCG ATGCAGGTCC GCGAACCCAA CACCGAAGGC
GAAGGCGCGA TCTCCGCGTC CGACCTGGTC CGGGCCGCGC TGCGGATGTC TCCGGACCGG
GTGATCGTCG GGGAGGTCCG CGGGCCTGAG GTCATTCCGA TGCTCAACGC GATGTCCCAG
GGCAACGACG GTTCGATGAC CACGCTGCAC TCCTCGACCA GCCGCGGCGT CTTCACGAGG
TTGGCCAGCT ACGCCGTCCA GGGCCCGGAA CGACTTCCCG TCGAGGCGAC GAACCTGCTG
ATCGCCAGCG CGATCCACGT GGTGGTGCAC CTCGCCGAAC CCCGAGGCGA ACGCGGCCGA
CGGGTCGTGT CCAGCGTCCG GGAGGTCGTC GACGCCGACG GCACCCAGGT GATCACCAAC
GAGCTGTACC GACCGGGACC CGACCGGCGC GCCCTGCCGG CGGCACCGCC GACCGGGGAA
CTGCTCGACG ACCTGATCGA CGTGGGGTTC GACCCGGACG TGCTGTCCCG GGGGTGGTGG
GGATGA
 
Protein sequence
MTVPSWPDPR RPASGNGAGR PGPFLPWTGG GGGPSAPTDL PGRASGSAAD ALRVRLRDGL 
RAALARRLRA DEEAGSPPLT AQAREAFALS VLVDLTEAHT TAELARGAGV LSPEDEQRVL
HEVLAEVLGL GGLEPLLADP SIENININGD RVFIRRADGS RHRLPAITDS DAQLVGLIRD
LAAHAGVEER RWDRGAPMVN FHLADKSRVF AVMAVTQRPS VSIRRHRFRH VTLAALRANG
TIDYGLEALL AALVAARKNI VVAGGTAIGK TTMLLALADQ IPPHERLVTV EDVYELGLDT
DAQAHPDVVA MQVREPNTEG EGAISASDLV RAALRMSPDR VIVGEVRGPE VIPMLNAMSQ
GNDGSMTTLH SSTSRGVFTR LASYAVQGPE RLPVEATNLL IASAIHVVVH LAEPRGERGR
RVVSSVREVV DADGTQVITN ELYRPGPDRR ALPAAPPTGE LLDDLIDVGF DPDVLSRGWW
G