Gene Franean1_5135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5135 
Symbol 
ID5673469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6151638 
End bp6153506 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content75% 
IMG OID641243985 
Producttype III restriction protein res subunit 
Protein accessionYP_001509399 
Protein GI158316891 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0782203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCCGC GGTGCGCGGC GTCGTCGTCC GGCTGGCGCA CCACCATAGA GTCTGATGTC 
GTGAGAGTCG GACCCCTGCC TCAGAGCTCC AGCGGGCAAG GCCCCAGCGG ACGAGGCCCC
CGCGCGCGCG GCGAGGCCCG GCCCCTGCGG GCATGGCAAC GCGCAGCCCT GGAGACCTAC
CGGTCGCGCA GCGCGTCCGG CGGCCGCGAC TTCCTGGCCG TCGCGACCCC GGGCGCCGGC
AAGACGACGT TCGCGCTGGA GATCGCCGCC GACCTGTTGG CCGCGGGCGA GGTGCGTTCG
GTGACCGTGG TCGCCCCGAC CGAGCACCTC AAGCGGCAGT GGGCCAACGC CGCCTCCGCG
GTCGGGGTCG ACCTGGACCC GACCTTCCGC AACTCGGCCG GCGCCACCGC GTCGGACTAC
ACCGGGGTCG CGGTCACCTA CGCCCAGGTC GCCGCGCACC CCGCCCTGCA CCGCATGCGC
ACGGCCGCGC GCCGCACGCT GGTGATCCTC GACGAGATCC ACCACGCGGG CGACGCCCTG
TCCTGGGGCG AGGCGGTCCG GGAGGCGTTC GAGCCGGCCG CCCGCCGGCT CGCACTCACC
GGAACCCCGT TCCGGTCCGA CGTCAACCCG ATCCCGTTCG TGACCTACCT GCCCGACGCC
GAGGGCGTGA CGCGCAGCGT CGCGGACTCC TCCTACGGTT ACGCCGAGGC GCTGCGTGAC
GGTGTGGTCC GTCCCGTGCT CTTCCTCGCC TACTCGGGTG AGATGTCCTG GCGCACCAGC
GCCGGCGCGG AGCTCAGCGC CCGGCTCGGT GAGCCGCTGA ACAGCGAGCA GACCGCGGCA
GCGTGGCGCA CGGCCCTCGA CCCCCGCGGA GACTGGATGC CCGCGGTCCT GGCCGCCGCC
GACACCCGGC TCTCCCAGGT GCGCCGGGGC GGGATGCCGG ATGCCGGAGG CCTGGTCATC
GCGACCGACC ACACCAACGC CCGCGCCTAC GCCGGCCTGC TGCGGCGGAT CACCGGCGCG
TCTCCCGTGA TCGTCCTCTC CGACGACCCG ACCGCCAGCA CGAAGATCGC CACGTTCCGT
GAGTCGACGG ACCGGTGGAT GGTCGCCGTC CGCATGGTCA GTGAGGGCGT GGACGTCCCC
CGCCTGGCGG TCGGCGTGTA CGCCACCTCG GCCTCGACGC CGCTGTACTT CGCCCAGGCC
GTCGGCCGGT TCGTCCGTGG CCGCGGACGC TCGGAGACGG CGTCGGTGTT CCTGCCCAGC
GTCCCGTCGC TGCTGGCGCT GGCCGGCGAG ATGGAGGTCC AGCGCGACCA CGCGCTCGAC
AAGCCGCAGC GCGAGCCCGA CGCGTTCGAC GACGACGCGC TGCGCGAGGC CAACCGCCGC
CGTGACACCC CCGACAAGCC CGACACCCTG TTCACCGCGC TCGGCTCCTC CGCCCAACTC
GACCGGGTGA TCTTCGACGG CGGCGAGTTC GGCACGCCGG CCGCCTCCGG CTCCCTCGAG
GAGGAGGACT TCCTGGGTCT GCCGGGCCTG CTCGAGCCCG ACCAGGTCGC GACCCTGCTG
CGCCAGCGCC AGGCGGCGCA GCAGGCCGCC GCGGCGAAGG CGCAGTCCGC GGCCGGCGAA
CCCGTGGTGC CGGCCGCCCG GCAGGGGGAG GCGGGCACCG ACCCGGGCGA CCGGCCCGTC
CACGAGCAGA TCGGTGACCT GCGGCGCGAG CTGAACAAGC TTGTCGCCGC GCACTACCAT
CGCACCGGAA AGCCGCACGG GATGATCCAC GCCGAGCTGC GCCGCTCCTG CGGCGGCCCG
CCGAGCGCCC AGGCCAGCAC GGCCCAGCTC CAGGCCCGGA TCGACACGAT GCGCCGCTGG
GCCGGCTGA
 
Protein sequence
MAPRCAASSS GWRTTIESDV VRVGPLPQSS SGQGPSGRGP RARGEARPLR AWQRAALETY 
RSRSASGGRD FLAVATPGAG KTTFALEIAA DLLAAGEVRS VTVVAPTEHL KRQWANAASA
VGVDLDPTFR NSAGATASDY TGVAVTYAQV AAHPALHRMR TAARRTLVIL DEIHHAGDAL
SWGEAVREAF EPAARRLALT GTPFRSDVNP IPFVTYLPDA EGVTRSVADS SYGYAEALRD
GVVRPVLFLA YSGEMSWRTS AGAELSARLG EPLNSEQTAA AWRTALDPRG DWMPAVLAAA
DTRLSQVRRG GMPDAGGLVI ATDHTNARAY AGLLRRITGA SPVIVLSDDP TASTKIATFR
ESTDRWMVAV RMVSEGVDVP RLAVGVYATS ASTPLYFAQA VGRFVRGRGR SETASVFLPS
VPSLLALAGE MEVQRDHALD KPQREPDAFD DDALREANRR RDTPDKPDTL FTALGSSAQL
DRVIFDGGEF GTPAASGSLE EEDFLGLPGL LEPDQVATLL RQRQAAQQAA AAKAQSAAGE
PVVPAARQGE AGTDPGDRPV HEQIGDLRRE LNKLVAAHYH RTGKPHGMIH AELRRSCGGP
PSAQASTAQL QARIDTMRRW AG