Gene Francci3_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3584 
Symbol 
ID3904138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4283869 
End bp4285209 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content72% 
IMG OID637880905 
ProductDNA processing protein DprA, putative 
Protein accessionYP_482665 
Protein GI86742265 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCGA CGACTCCGCC GGATGCCCGG GGGGCGGATC CCGCTTCCGC GCCCGGGAGG 
GTGGCACCGG GTGACAGCGA CTGGTCGGAT CCCGAGCGGT TGGCCCGTGT CGCCCTGGCT
CGCGTCTTCG GCCCGGAACA TCGTCGCGTG GCCGTCGAGG TCAGGCGTCG GGGTGCGTTC
GAGGTGTGGA ATGCGCTCCG GGCGGCGCAT CCGAGTGTCG ATCCGGTTCG GGACCTGGAC
GCGGCATGGC GCGCCGGCGC CCGGCTGGTC TGTCCCCAGG ACGCCGAGTG GCCCCTCGAA
CTGGATGCCC TGGACCGCCT TCGGGACGCG GGGGATGGTT CGATGATCGG CACTCCACTG
GCTCTGTGGG TTCGCGGTCC GCTCAACCTG AGCGAGCTCC CACCCCGGGC GGTCACAGTC
GTGGGCTGCC GGACCGCGAC CAGCTACGGG CTGCATCTCG CCGGAGAGAT CGCGTTTGCG
ATGGCGGAAC AGGGATGGGC CGTGGTGTCG GGAGCCGCGT TCGGCATTGA CGCAGCGGCA
CATCGGGGGG CGTTGGCCGC AGCCGGACCG ACGGTGGCGG TGCTCGCCGG GGGTGTTGAC
GTTCCCTACC CGACCGCCCA TGTGGAACTG CTGGAGGAGA TCGCCCGTAC CGGGGCGGTA
GTCAGCGAGG TGTCGCCGGG CACGCCGCCG TACCGACGCC GATTCCTCAC CCGTAATCGC
ATCATCGCGG CTCTGTCCCG GGGGACGGTC CTGGTCGAGG CGGGCCACCG TAGCGGCGCG
CTGAACACGG TCGCCCACAC CCGTCGGCTC GGTCGTCCCG TCATGGTCGT TCCGGGACCG
GTGACCAGCG CCATGAGCGC AGGCTGTCAC CGGCTGCTCC GGGACTTCCG TGAACAGACG
GTTCTGGTCA CCGGGGCCGA GGACATCAGG GAGGAAATCG CGAGTATCGG ATCGCTCGTA
CAGCGGCCGG CGAGCGGGAA TGGCCCGCGG GACGGATTGT CCGAGGCGGT GCGCGAGCTT
CTCGACGCGA TGCCGGCCCG CGCTGCCGTC GGGGTGTCCG TGCTGGCGCG CCGCACCGGC
CTGCGCCCCG AGGCGGTGCT GGCGATGCTG GGCCCACTCG CGGTGGAGGG GCTGGTCGAG
AACGTGGCGG GCGGTTACCG CCTCACGGAT CTGGGCCGAG CGCCGTCGAA CCCGTCCCAT
CCCGCAACGT CCGGCAGGAG GTCCGGTACC CAACCGGGTG CCGCCCACGG CGGGGCGGAT
CGTTCCCCGA CGCGCAGTAC CGCGGATCCG GGACTCGACG GGGAAAACCC CGACGGCGAA
AACCCCGACG GGGAGACCTG A
 
Protein sequence
MSSTTPPDAR GADPASAPGR VAPGDSDWSD PERLARVALA RVFGPEHRRV AVEVRRRGAF 
EVWNALRAAH PSVDPVRDLD AAWRAGARLV CPQDAEWPLE LDALDRLRDA GDGSMIGTPL
ALWVRGPLNL SELPPRAVTV VGCRTATSYG LHLAGEIAFA MAEQGWAVVS GAAFGIDAAA
HRGALAAAGP TVAVLAGGVD VPYPTAHVEL LEEIARTGAV VSEVSPGTPP YRRRFLTRNR
IIAALSRGTV LVEAGHRSGA LNTVAHTRRL GRPVMVVPGP VTSAMSAGCH RLLRDFREQT
VLVTGAEDIR EEIASIGSLV QRPASGNGPR DGLSEAVREL LDAMPARAAV GVSVLARRTG
LRPEAVLAML GPLAVEGLVE NVAGGYRLTD LGRAPSNPSH PATSGRRSGT QPGAAHGGAD
RSPTRSTADP GLDGENPDGE NPDGET