Gene Francci3_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0034 
Symbol 
ID3903609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp39979 
End bp42114 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content71% 
IMG OID637877364 
ProductType IV secretory pathway VirB4 components-like 
Protein accessionYP_479157 
Protein GI86738757 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.694512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTCAG CACACGACGA CGCCTCGGCG CGCGGGTACC CGGTCCCGAC CGGCTACCCG 
ACCGGCGGCT ATCCGGGTAA CAACTATCCG GGTAACAGCA ACACGAGCCA TGGCCAGCCA
GGCAATGGCC AGCTGAGTAA CGGCTATGCG GGCAACGGCT ATGCGGGCAA CGGCCATGCG
GTGGGCGGTT ACGTGCCGGC CGGCTATCCA GCCGGTGAAG CCGTGGCCGG TCACCCCGAG
CGCGCCGACA GCAGCCCCGG CGGCCCCGAC GACGAGGGCC CCGACGGCGA CTTCGGCGAC
GGCGACTTCG ACGACAGCGA CTTCGACGAC AGCGACTTCG ACGACCCGCC CAGCTACGAC
GAGACCGGTT ACGACCCCGA CGGCTACGAG GCGGCGGACT ACGAGGCCAC CTCCCACGAC
CCGGGACGCG ATGCCGAGGG CTATGTGGAC CTCGGCGGCG GTCCGCGGCG GCGGCGCGAG
GGACGCCTAT CCGCGGAGCC CCCCCGGCGC AGCCGGCGCG CGGAACGGGC CGCGGCGGCC
ACCGAGAAGG CTGCGATGCG CCGGGCCCAG ATAGCCCTGC GCCGGGGCGC ACAGCCGCCA
CGGCTGTTCG GGCCGTTCGG CGGGCGAACG TTCCACAAGC TGCGGCTTCC CGCGCACATG
GAGACGACGG CGCAGATCGC CGGCATCTAC CCGTTCGTCG TCGACTCCGG CCTGGCCGCG
CCCGGCATGT ACATCGGTCG GCACGTCTGG TCCGGGAACT CGTTCGACTT CGACGTCTTC
GAGCTGTACC GCCAGCAGGT CATCGAGAAT CCGAACTTCG CCGTGTTCGG CGCGGTCGGC
TCCCGCAAGT CGGCGCTGCT GAAGACGCTG ATCTCGCGAG GGGCGGCGTT CGGCTACCAG
GCCGCCGTAC CCTGTGACCC GAAGGGCGAG TACACGCGGC TGGCTCGGCG CCTCGGCTGT
GAGCCCACCT ACATCGGGCC CGGCATGGCC ACCCGGCTGA ACCCGCTCGA CGCCCCACCC
CGTCCGATGG GCATCGCCGA CGAGGACTGG GCCCGTGAGG TGAAGCGGGC TCGTTCGGCG
CTGCTGTCGT CCCTGATCGA GACGGCGAAG GGGGTGCCGC TCACCCCGGC CGAGCACACC
GCGGTCGACC TGGCTCTCGA CGTCGTGACC CGGCAGATCA CCGGGGCGTC CCCGGACCGG
TGGGCAACCC CGATCCTGCC GCACGTGCTG GAGGCGATGA CGGATCCAAC CGAGGAGGAC
TGTGTCAACC TCCCGATGAC CGCGACCGAG CTGCGCGACG CCTCCCGGGA TGCCACGCTG
ACCCTGCGCC GGCTCACACA CGGCGCGCTG GGCGGTCTGT TCGACGGGCC GACCACCTCT
CCCCTGGACT TCGACCGTCC GATCGCCGTG CTGAACCTGG AGCGGGTCCA GGGCAGCGAT
GAGATGATCG CCCTGATCAT GACCTGCGCG CAGGCGTGGA TGGAAGCGGC ACTCATGCGC
CAGGACGGCG TGCAGCGCTA CGTCGTCTAC GACGAGTGCT GGCGGCTGAT GCGGTTCGCC
GGCCTCGTCC GCCGGCTGTC CGCCCAGCAG AAGCTCGCCC GACAGTGGGG ATGCGCGAAC
GCGATCGTCG CCCACCGCAT CTCCGACCTG CTGTCCGCCT CGCCGGACTC GGTGGAGATC
GCCAAAGGCC TGCTCGCCGA AACCGCGATC CGGATCCTCT ACAAGCAGGC GTCCGATCAG
ATCGCCGACA CGCAGGCCGC GCTCGGGCTC ACCGACGTCG CCGCCGACCT GCTCCCCAGG
CTCGACCCGG GCTACGCCCT GTGGCTGATC GGCGCCCGCG CCTTCTACGT CGAGCACGTC
GTCGGCGACC TGGAGATCCC CGTGGTGCTC AACGGCTCGA AGATGCACGG CGAGGTCGAC
GACACTAACC TGACCCCCGA CGATCTGGAT CCGGCCGAGC TGGACCCGCC GGATCTCGGT
CCCGGCCGGC TGCGCCGGAT GGCCGACGAA CTCCTCGGCG ACCGGGAGCT CGCCGGCCGG
GAGCTCGCCG GCGGCCCGGC CGTCCCACGG GCCGCCGACC TCACGGGCGG GTTCGGTTAC
CCGCCTGTGG AGCCCAGTCC CAGCTCCACC GGCTGA
 
Protein sequence
MSSAHDDASA RGYPVPTGYP TGGYPGNNYP GNSNTSHGQP GNGQLSNGYA GNGYAGNGHA 
VGGYVPAGYP AGEAVAGHPE RADSSPGGPD DEGPDGDFGD GDFDDSDFDD SDFDDPPSYD
ETGYDPDGYE AADYEATSHD PGRDAEGYVD LGGGPRRRRE GRLSAEPPRR SRRAERAAAA
TEKAAMRRAQ IALRRGAQPP RLFGPFGGRT FHKLRLPAHM ETTAQIAGIY PFVVDSGLAA
PGMYIGRHVW SGNSFDFDVF ELYRQQVIEN PNFAVFGAVG SRKSALLKTL ISRGAAFGYQ
AAVPCDPKGE YTRLARRLGC EPTYIGPGMA TRLNPLDAPP RPMGIADEDW AREVKRARSA
LLSSLIETAK GVPLTPAEHT AVDLALDVVT RQITGASPDR WATPILPHVL EAMTDPTEED
CVNLPMTATE LRDASRDATL TLRRLTHGAL GGLFDGPTTS PLDFDRPIAV LNLERVQGSD
EMIALIMTCA QAWMEAALMR QDGVQRYVVY DECWRLMRFA GLVRRLSAQQ KLARQWGCAN
AIVAHRISDL LSASPDSVEI AKGLLAETAI RILYKQASDQ IADTQAALGL TDVAADLLPR
LDPGYALWLI GARAFYVEHV VGDLEIPVVL NGSKMHGEVD DTNLTPDDLD PAELDPPDLG
PGRLRRMADE LLGDRELAGR ELAGGPAVPR AADLTGGFGY PPVEPSPSST G