Gene Francci3_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0906 
Symbol 
ID3906281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1048870 
End bp1051167 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content70% 
IMG OID637878239 
ProductType IV secretory pathway VirD4 components-like 
Protein accessionYP_480019 
Protein GI86739619 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCGG CCATCGTCGT GGCGGCCAGC ACCCTCTTGG TGCTCGGGAC CATCATCGGG 
ATGCGGGCGC TCGACGCCCG GCGCTGGCGG ACGTCCCTGG TGGCCTACCG CTTGACCATC
CCGTTCGACC TCAAGCCCGA CGACGTAGCT CAGTGGCTGG CGCTGGTTGC CGCAGCAACC
CAAGCGCCGC GCTGGTCGCT GCTGGTTCTG CCGCCGCTGG GCCTAGAGAT CGTGGCCACC
CAGCAGGGGA TCACGCACTA CGTGCTGCTG CCGAAGGCTG CCGAGGCCCG TCTGCTGAAC
ACGATCCGGG CGGGCTTGCC CAGCGCCAGG CTGGAAGCCG CCCCGGAGTT CCTGGACCGC
CAGCTGCGTT GCCGGATTGC CGGCGAACTG ACCATGACCT CCCGCCGCCG GCCGCTGTCG
CGCGATCGAG CCGAGAGCAT CACGGCCAGC CACCTGTCGT CGCTTCAGCC GCTGCGTTCG
GGCGAGGAAG TACGGGTGCA GTGGGCGCTG ACCAGCGCCG GGAACAGCGC TCCGGTGCAC
AGCGCCTCGG CCCGTGCCGG CGACGGCTGG TGGAGTACCT ACCTCGTGGA GGGCACGGTG
CCGGCCAATG CCGAAGCTGT ACGCGCCCAG CGCACCAAGG AAGCCGATCC GCTGCTGCAC
GCCGTAGCCC GGGTTGGAGT CGTGGCGGCC GATCACCGGC GGGCCAAGAC CTTGCTGGCG
CGTACCTGGT CGACGCTGCA CGGGCCGAAC GCCCCGGGCG TCAGCATGGT GCGCCGCTGG
CTGCCGGCCG ACATCGTGGC CCGGCGGATG GAACGGCGGG CGCTGCCGTT GACCCGCTGG
CCGCTGCTAC TCAGCAGTGT CGAACTACCG AGCCTGGTCG GTTTTCTGCT GGGTTCGGTG
TCGCTGCCGG GCATGCCGCA GGGCGGTAGC CGCCTCCTGG CTCCGTCACC GGGGATGCCC
CGCACCGGCA CGATACTGGC GCAGAGCAAC TACGGCAGCG AGGCCGTACC CCTGGCGATG
CGGACGCCGG ACCGGCTGCG CCACCTCTAC CTCTTGGGGC CGACCGGCGT CGGGAAGTCC
ACCCTGATCG ACAACGTGGC GCTCCAGGAT GCCGCCGCCG GCCTCGCCGT CCTGCTGATC
GACCCCAAGG GCGACCTGGT GGACGACTTC CTGGCCCGCG TGCCGGAGGA ACGGGCGGAC
GACGTCGTAG TGCTCGACCC ATCGGCCACC GCCCGGCCGG TGGGCTTCAA CCTCTTCGGG
GGCCTGCGCA CGGAGCAGGA CAAGGAACTG GCCGTAGACA ACGTGGTGCA CATCATGGCC
GAGCTGTGGC GCAGCTCGTT CGGCCCGCGG ACCACGGACG TCTTGCGCAG CTCCCTGTTG
ACATTGACGC ACACCACGGC CGCGGACGGC TCAGCCTTCA CCCTGGCCGA GGTCCCAGAA
TTGCTAATGA ATCCGACATT TCGCCGCTCC GTGACCAACC AGCCGAGTGT GCCGGCCGGG
GTCCGACCCT TCTGGCACGC CTACGAGGAG ATGAGCGACG TCCAGCGCTT GCAGGTCATC
GGGCCAGCCA TGAACAAGCT GCGGGCCATC CTGACCCGTT CGCCGTTGCG CCTCATGCTG
GGGCAGAGCC AGGGCTTCGA CCTCACCGAA CTGTTCACCA AACGCCGCAT CGTGCTGGCC
CCGCTGTACA AGGGCGTCAT CGGCACCGAC ACGGCGCAGT TGCTCGGCGC GCTGTTGGTG
GCCCTCTTCT GGCAGCGCAC CCTGGCACGG GCCGCCGTGC CCGCCGAGCA GCGGCGGCCG
GTCATGGTCC ACGTAGATGA ATTCCAGGAT GTCCTGCGGC TGCCACTGGA CATCGCCGAC
ATGCTGGCTC AGGCCCGCGG CCTCGGTGTC GGGCTGACGC TGGCGAACCA GCAGCTCGGC
CAGCTCTCGG ACGCCATCAA GTCGGCGGTA CTGGGGACGG TGCGCTCGTC GGTCGTCTTC
CAACTCGACT ACGACGACGC CCAGAAGATG GCCCAACGGT TCGCGCCCCT GACCCGGGAC
GACCTCATGG GCCTGAGCGC GTACGAGGTG GCGCTCCGGC TCAACATCAA CAACACCACG
TATCGGCCGG TGACCGGGAA GACGTTGCCG CTACCGGATG CACTGCGTGA CGGCCGGGAC
TTGGCCGAAG CCAGCCGCCA GCGCTTCGGC ACGGCCCGCG AGGACGTCGA AGCGGCTTTG
CGCGCCCGTG TCGGCCGCCC GGACGAGACC GGCACCGGCG GGACCATCGG CCGCCGTCGG
CGAGGGGGCA CAGCATGA
 
Protein sequence
MIPAIVVAAS TLLVLGTIIG MRALDARRWR TSLVAYRLTI PFDLKPDDVA QWLALVAAAT 
QAPRWSLLVL PPLGLEIVAT QQGITHYVLL PKAAEARLLN TIRAGLPSAR LEAAPEFLDR
QLRCRIAGEL TMTSRRRPLS RDRAESITAS HLSSLQPLRS GEEVRVQWAL TSAGNSAPVH
SASARAGDGW WSTYLVEGTV PANAEAVRAQ RTKEADPLLH AVARVGVVAA DHRRAKTLLA
RTWSTLHGPN APGVSMVRRW LPADIVARRM ERRALPLTRW PLLLSSVELP SLVGFLLGSV
SLPGMPQGGS RLLAPSPGMP RTGTILAQSN YGSEAVPLAM RTPDRLRHLY LLGPTGVGKS
TLIDNVALQD AAAGLAVLLI DPKGDLVDDF LARVPEERAD DVVVLDPSAT ARPVGFNLFG
GLRTEQDKEL AVDNVVHIMA ELWRSSFGPR TTDVLRSSLL TLTHTTAADG SAFTLAEVPE
LLMNPTFRRS VTNQPSVPAG VRPFWHAYEE MSDVQRLQVI GPAMNKLRAI LTRSPLRLML
GQSQGFDLTE LFTKRRIVLA PLYKGVIGTD TAQLLGALLV ALFWQRTLAR AAVPAEQRRP
VMVHVDEFQD VLRLPLDIAD MLAQARGLGV GLTLANQQLG QLSDAIKSAV LGTVRSSVVF
QLDYDDAQKM AQRFAPLTRD DLMGLSAYEV ALRLNINNTT YRPVTGKTLP LPDALRDGRD
LAEASRQRFG TAREDVEAAL RARVGRPDET GTGGTIGRRR RGGTA