Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1736 |
Symbol | |
ID | 3906802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2070744 |
End bp | 2072189 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637879074 |
Product | type II secretion system protein E |
Protein accession | YP_480841 |
Protein GI | 86740441 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00343122 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.149878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC CGTCCTGGCC CGACCCCCGC CGCCCGGCGT CCGGTAACGG CGCGGGCAGG CCCGGTCCGT TTCTTCCCTG GACCGGTGGC GGTGGGGGTC CGTCCGCGCC GACCGATCTT CCGGGCCGTG CCTCGGGGTC GGCGGCCGAC GCCCTGCGTG TCCGGCTCCG GGACGGGCTG CGGGCCGCGC TCGCCCGCCG GCTGCGCGCG GACGAGGAGG CAGGCTCCCC GCCGCTGACC GCGCAGGCGC GGGAGGCGTT CGCGCTGTCC GTGCTTGTGG ATCTGACCGA GGCGCACACC ACCGCCGAGC TGGCCCGCGG TGCGGGGGTC CTGTCACCCG AGGACGAGCA GCGCGTCCTC CACGAGGTCC TCGCCGAAGT CCTCGGACTC GGCGGCCTCG AACCGTTGCT CGCCGACCCG TCGATCGAGA ACATCAACAT CAACGGGGAT CGGGTGTTCA TCCGCCGGGC CGACGGCAGC CGACACCGGC TACCAGCGAT CACCGACTCG GACGCCCAGC TGGTCGGCCT GATCCGGGAC CTGGCCGCGC ACGCCGGGGT CGAGGAACGG CGCTGGGACC GCGGCGCCCC CATGGTCAAC TTCCACCTGG CCGACAAGAG CCGCGTGTTC GCGGTCATGG CGGTCACCCA GCGGCCGTCA GTGAGTATTC GCCGGCACCG GTTCCGCCAC GTCACCCTCG CGGCGCTGCG CGCGAACGGC ACGATCGACT ACGGGCTGGA AGCGCTGCTG GCCGCGCTGG TAGCGGCGCG GAAGAACATC GTGGTCGCCG GGGGCACGGC GATCGGGAAG ACCACGATGC TGCTCGCATT GGCCGACCAG ATCCCACCGC ACGAGCGGCT GGTGACCGTC GAGGACGTCT ACGAGCTCGG GCTGGACACC GACGCGCAGG CCCATCCGGA CGTGGTCGCG ATGCAGGTCC GCGAACCCAA CACCGAAGGC GAAGGCGCGA TCTCCGCGTC CGACCTGGTC CGGGCCGCGC TGCGGATGTC TCCGGACCGG GTGATCGTCG GGGAGGTCCG CGGGCCTGAG GTCATTCCGA TGCTCAACGC GATGTCCCAG GGCAACGACG GTTCGATGAC CACGCTGCAC TCCTCGACCA GCCGCGGCGT CTTCACGAGG TTGGCCAGCT ACGCCGTCCA GGGCCCGGAA CGACTTCCCG TCGAGGCGAC GAACCTGCTG ATCGCCAGCG CGATCCACGT GGTGGTGCAC CTCGCCGAAC CCCGAGGCGA ACGCGGCCGA CGGGTCGTGT CCAGCGTCCG GGAGGTCGTC GACGCCGACG GCACCCAGGT GATCACCAAC GAGCTGTACC GACCGGGACC CGACCGGCGC GCCCTGCCGG CGGCACCGCC GACCGGGGAA CTGCTCGACG ACCTGATCGA CGTGGGGTTC GACCCGGACG TGCTGTCCCG GGGGTGGTGG GGATGA
|
Protein sequence | MTVPSWPDPR RPASGNGAGR PGPFLPWTGG GGGPSAPTDL PGRASGSAAD ALRVRLRDGL RAALARRLRA DEEAGSPPLT AQAREAFALS VLVDLTEAHT TAELARGAGV LSPEDEQRVL HEVLAEVLGL GGLEPLLADP SIENININGD RVFIRRADGS RHRLPAITDS DAQLVGLIRD LAAHAGVEER RWDRGAPMVN FHLADKSRVF AVMAVTQRPS VSIRRHRFRH VTLAALRANG TIDYGLEALL AALVAARKNI VVAGGTAIGK TTMLLALADQ IPPHERLVTV EDVYELGLDT DAQAHPDVVA MQVREPNTEG EGAISASDLV RAALRMSPDR VIVGEVRGPE VIPMLNAMSQ GNDGSMTTLH SSTSRGVFTR LASYAVQGPE RLPVEATNLL IASAIHVVVH LAEPRGERGR RVVSSVREVV DADGTQVITN ELYRPGPDRR ALPAAPPTGE LLDDLIDVGF DPDVLSRGWW G
|
| |