Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0034 |
Symbol | |
ID | 3903609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 39979 |
End bp | 42114 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637877364 |
Product | Type IV secretory pathway VirB4 components-like |
Protein accession | YP_479157 |
Protein GI | 86738757 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.694512 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTCAG CACACGACGA CGCCTCGGCG CGCGGGTACC CGGTCCCGAC CGGCTACCCG ACCGGCGGCT ATCCGGGTAA CAACTATCCG GGTAACAGCA ACACGAGCCA TGGCCAGCCA GGCAATGGCC AGCTGAGTAA CGGCTATGCG GGCAACGGCT ATGCGGGCAA CGGCCATGCG GTGGGCGGTT ACGTGCCGGC CGGCTATCCA GCCGGTGAAG CCGTGGCCGG TCACCCCGAG CGCGCCGACA GCAGCCCCGG CGGCCCCGAC GACGAGGGCC CCGACGGCGA CTTCGGCGAC GGCGACTTCG ACGACAGCGA CTTCGACGAC AGCGACTTCG ACGACCCGCC CAGCTACGAC GAGACCGGTT ACGACCCCGA CGGCTACGAG GCGGCGGACT ACGAGGCCAC CTCCCACGAC CCGGGACGCG ATGCCGAGGG CTATGTGGAC CTCGGCGGCG GTCCGCGGCG GCGGCGCGAG GGACGCCTAT CCGCGGAGCC CCCCCGGCGC AGCCGGCGCG CGGAACGGGC CGCGGCGGCC ACCGAGAAGG CTGCGATGCG CCGGGCCCAG ATAGCCCTGC GCCGGGGCGC ACAGCCGCCA CGGCTGTTCG GGCCGTTCGG CGGGCGAACG TTCCACAAGC TGCGGCTTCC CGCGCACATG GAGACGACGG CGCAGATCGC CGGCATCTAC CCGTTCGTCG TCGACTCCGG CCTGGCCGCG CCCGGCATGT ACATCGGTCG GCACGTCTGG TCCGGGAACT CGTTCGACTT CGACGTCTTC GAGCTGTACC GCCAGCAGGT CATCGAGAAT CCGAACTTCG CCGTGTTCGG CGCGGTCGGC TCCCGCAAGT CGGCGCTGCT GAAGACGCTG ATCTCGCGAG GGGCGGCGTT CGGCTACCAG GCCGCCGTAC CCTGTGACCC GAAGGGCGAG TACACGCGGC TGGCTCGGCG CCTCGGCTGT GAGCCCACCT ACATCGGGCC CGGCATGGCC ACCCGGCTGA ACCCGCTCGA CGCCCCACCC CGTCCGATGG GCATCGCCGA CGAGGACTGG GCCCGTGAGG TGAAGCGGGC TCGTTCGGCG CTGCTGTCGT CCCTGATCGA GACGGCGAAG GGGGTGCCGC TCACCCCGGC CGAGCACACC GCGGTCGACC TGGCTCTCGA CGTCGTGACC CGGCAGATCA CCGGGGCGTC CCCGGACCGG TGGGCAACCC CGATCCTGCC GCACGTGCTG GAGGCGATGA CGGATCCAAC CGAGGAGGAC TGTGTCAACC TCCCGATGAC CGCGACCGAG CTGCGCGACG CCTCCCGGGA TGCCACGCTG ACCCTGCGCC GGCTCACACA CGGCGCGCTG GGCGGTCTGT TCGACGGGCC GACCACCTCT CCCCTGGACT TCGACCGTCC GATCGCCGTG CTGAACCTGG AGCGGGTCCA GGGCAGCGAT GAGATGATCG CCCTGATCAT GACCTGCGCG CAGGCGTGGA TGGAAGCGGC ACTCATGCGC CAGGACGGCG TGCAGCGCTA CGTCGTCTAC GACGAGTGCT GGCGGCTGAT GCGGTTCGCC GGCCTCGTCC GCCGGCTGTC CGCCCAGCAG AAGCTCGCCC GACAGTGGGG ATGCGCGAAC GCGATCGTCG CCCACCGCAT CTCCGACCTG CTGTCCGCCT CGCCGGACTC GGTGGAGATC GCCAAAGGCC TGCTCGCCGA AACCGCGATC CGGATCCTCT ACAAGCAGGC GTCCGATCAG ATCGCCGACA CGCAGGCCGC GCTCGGGCTC ACCGACGTCG CCGCCGACCT GCTCCCCAGG CTCGACCCGG GCTACGCCCT GTGGCTGATC GGCGCCCGCG CCTTCTACGT CGAGCACGTC GTCGGCGACC TGGAGATCCC CGTGGTGCTC AACGGCTCGA AGATGCACGG CGAGGTCGAC GACACTAACC TGACCCCCGA CGATCTGGAT CCGGCCGAGC TGGACCCGCC GGATCTCGGT CCCGGCCGGC TGCGCCGGAT GGCCGACGAA CTCCTCGGCG ACCGGGAGCT CGCCGGCCGG GAGCTCGCCG GCGGCCCGGC CGTCCCACGG GCCGCCGACC TCACGGGCGG GTTCGGTTAC CCGCCTGTGG AGCCCAGTCC CAGCTCCACC GGCTGA
|
Protein sequence | MSSAHDDASA RGYPVPTGYP TGGYPGNNYP GNSNTSHGQP GNGQLSNGYA GNGYAGNGHA VGGYVPAGYP AGEAVAGHPE RADSSPGGPD DEGPDGDFGD GDFDDSDFDD SDFDDPPSYD ETGYDPDGYE AADYEATSHD PGRDAEGYVD LGGGPRRRRE GRLSAEPPRR SRRAERAAAA TEKAAMRRAQ IALRRGAQPP RLFGPFGGRT FHKLRLPAHM ETTAQIAGIY PFVVDSGLAA PGMYIGRHVW SGNSFDFDVF ELYRQQVIEN PNFAVFGAVG SRKSALLKTL ISRGAAFGYQ AAVPCDPKGE YTRLARRLGC EPTYIGPGMA TRLNPLDAPP RPMGIADEDW AREVKRARSA LLSSLIETAK GVPLTPAEHT AVDLALDVVT RQITGASPDR WATPILPHVL EAMTDPTEED CVNLPMTATE LRDASRDATL TLRRLTHGAL GGLFDGPTTS PLDFDRPIAV LNLERVQGSD EMIALIMTCA QAWMEAALMR QDGVQRYVVY DECWRLMRFA GLVRRLSAQQ KLARQWGCAN AIVAHRISDL LSASPDSVEI AKGLLAETAI RILYKQASDQ IADTQAALGL TDVAADLLPR LDPGYALWLI GARAFYVEHV VGDLEIPVVL NGSKMHGEVD DTNLTPDDLD PAELDPPDLG PGRLRRMADE LLGDRELAGR ELAGGPAVPR AADLTGGFGY PPVEPSPSST G
|
| |