Gene Francci3_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3809 
Symbol 
ID3905557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4566464 
End bp4568413 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content74% 
IMG OID637881135 
ProductDEAD/DEAH box helicase-like 
Protein accessionYP_482888 
Protein GI86742488 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.130283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.862178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCTC GGCCGCCGTG TAGCAGGCGT CGCCTCGTCA CCTGTGCACC CAGAGTTGCC 
ATCGAGGGCA CCGGGGCACA TGGATTTTCG GGTCGCGGGC GCCGCGCGGC TCACCGCGAG
CGGGTTCAGC GTAGACACCG CTCAGAGAAG ACCAGGAGCG TCCCGCTGTC CGTCACAGCT
GAAGTACCCA CAGATGATCT GACTCCCTAC GAGCCACAAA CCACCCCCTC CACCCCCGGG
TCCCCGGCGG CACCGACCTT CGCGGAGCTC GGCGTGCGCG CCGAGACCGT CTCGGCCCTG
ACCGAAGCGG GCATCGTGCA CGCTTTCCCC ATCCAGGAGT TGACGCTTCC ACTCGCCCTG
GCCCGCAACG ACATCATCGG GCAGGCCCGC ACCGGCACCG GCAAGACCCT CGCGTTCGGC
GTCCCGGTGG TGCAGACGGT GCTGGCGGCC AAGGAGGGTG CCGACGGCCG TCCGCAGGCC
CTCGTCGTGG TGCCCACCCG TGAGCTGTGC GTCCAGGTGA CCGCGGACGT CACCCGCGCC
GGCGCCCGCC GTGGCCTGCG GGTGCTGTCC GTCTACGGCG GGCGTGCCTA CGAGCCGCAG
CTGTCCGCGC TGCGCGCCGG GGTCGACATC GTCGTCGGCA CGCCCGGCCG CCTGCTGGAT
CTCGCCCGCC AGCACGTGCT CGACCTGGCC GGCGTCGGCA CCCTGGTGCT CGACGAGGCC
GACGAGATGC TCGACCTCGG CTTCCTGCCG GACGTCGAGC GCATCATGTC GCAGCTGCCG
ACCGAGCGGC AGACGATGCT GTTCTCCGCG ACCATGCCCG GCCCGGTCAT CTCCCTGGCC
CGGCGGTTCA TGAAACGGCC CGTGCACGTC CGCGCGGAAC AGCCGGATGA GGGGCGCACG
GTCCCGACCA CCCGTCAGCA CGTCTTCCGC GCCCACGCGC TGGACAAGAT GGAGGTGCTG
GCCCGGGTCC TGCAGGCCGG CGGCCGGGGG CTCGCCATGG TGTTCGTGCG GACCAGGCGC
ACCGCGGACA AGGTCGCCGA GGACCTCGCC AAGCGCGGCT TCGCGGCCGC GGCGGTGCAC
GGCGACCTGG GCCAGGGCCA GCGCGAGCAG GCGCTGCGCG CCTTCCGCTC CGGCAAGGTC
GACGTCCTGG TCGCCACCGA CGTGGCCGCC CGGGGCATCG ACATCAACGG TGTCACCCAC
GTGGTCAACT ACCAGTGCCC CGAAGACGAG AACGTCTATC TGCACCGCAT CGGCCGCACC
GGTCGGGCGG GCGAGAGCGG GGTGGCCATC ACCTTCGTCG ACTGGGACGA CCTGCCGCGG
TGGACGCTCG TCAACAAGGC GCTCGCCCTG CCGTTCGATG GCCCGGTGGA AACCTATTCC
ACCTCCCCCC ACCTGTACGA GGCGCTCGGC ATCCCGGCGG GCGCGAAGGG CACCCTGCCG
CACGCGGCGC GGACCCGCGC CGGGCTCGCG GCCGAGGACA TCGAGGATCT CGGGCAGTCC
GGTCGCGGCG GGCGCCGCGG CTCGCGGACC GGGCGTGACC AGGACCGTTC CGAGCCGGCG
GCGGTGCCGA CCCGGACTCG CGCCCGTCGG CGCACCCGCG GCGGTGGTGC GGCGGCCGCG
GGTGCGGGGC TGGCCATCGC CGCGGACCCG GCGGACCCGG CGGACCCGGT CGACGAGGAC
GGCCGGAAGG CCGGCGCACC CGTGGTGGAC GGTGCCGGGC AGACCGGGCT GGTCGAGTTC
ACCGGGACCG CCCCGCTCAC CGACACGGAC ACCGACACCG CCCGCGTCGT CTCCGCCCTG
GCCTCGGAGA CGGGCGTCGA GGCCGAGGAG TCGCCGCGCC GCCGGCGCCG GCGGCGCGGC
AACCGTGGCC GCGGCACGGG CACGATGCGG GAGGCCGGCG ACGGCACCGA GGCCGACGCC
GACGCGCCAC CCCGAGCCGA GTCGGCCTGA
 
Protein sequence
MSPRPPCSRR RLVTCAPRVA IEGTGAHGFS GRGRRAAHRE RVQRRHRSEK TRSVPLSVTA 
EVPTDDLTPY EPQTTPSTPG SPAAPTFAEL GVRAETVSAL TEAGIVHAFP IQELTLPLAL
ARNDIIGQAR TGTGKTLAFG VPVVQTVLAA KEGADGRPQA LVVVPTRELC VQVTADVTRA
GARRGLRVLS VYGGRAYEPQ LSALRAGVDI VVGTPGRLLD LARQHVLDLA GVGTLVLDEA
DEMLDLGFLP DVERIMSQLP TERQTMLFSA TMPGPVISLA RRFMKRPVHV RAEQPDEGRT
VPTTRQHVFR AHALDKMEVL ARVLQAGGRG LAMVFVRTRR TADKVAEDLA KRGFAAAAVH
GDLGQGQREQ ALRAFRSGKV DVLVATDVAA RGIDINGVTH VVNYQCPEDE NVYLHRIGRT
GRAGESGVAI TFVDWDDLPR WTLVNKALAL PFDGPVETYS TSPHLYEALG IPAGAKGTLP
HAARTRAGLA AEDIEDLGQS GRGGRRGSRT GRDQDRSEPA AVPTRTRARR RTRGGGAAAA
GAGLAIAADP ADPADPVDED GRKAGAPVVD GAGQTGLVEF TGTAPLTDTD TDTARVVSAL
ASETGVEAEE SPRRRRRRRG NRGRGTGTMR EAGDGTEADA DAPPRAESA