Gene Franean1_4741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4741 
Symbol 
ID5673083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5661882 
End bp5663357 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content75% 
IMG OID641243598 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001509014 
Protein GI158316506 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTTCGT TCACCGATCT TGGCGTGCCC GCGCGCCTGA CCGATGTTCT CACCCGTCTC 
GCCATCGAGA CCCCCACCCC CATCCAGCGG GCGACGCTGC CGGACGCGCT CGCCGGACGT
GACGTCCTCG GCCGCGGCCG CACGGGGTCC GGCAAGACGC TCGCGTTCCT CCTGCCCGTG
GTCGCGCGGC TGTCCGGCGG GCCCCGGGCC CAGGCCGGGG CTCCGCGCGC GCTGGTTCTG
GCGCCCACCC GCGAGCTCGC CGCGCAGATC GACACGGCGC TGCGACCGCT GGCCGCCGCG
GCCGGCCTCA CCTCGTGCAC CGTGTTCGGC GGCGTCGGTC AGGAACCGCA GGTGGCCGCG
ATCCGGCGGG GTGTCGACAT CGTCGTGGCC TGCCCCGGCC GGCTGGAGGA TCTCCTCGCG
CAGGGGCGCT GCTCGCTGGC CCAGGTGAAG ATCGTCGTGT TGGACGAGGC CGACCACATG
GCCGATCTGG GGTTCCTGCC CGCGGTGCGC CGGCTGATCG GCCAGACCGC GGCCGGTGGG
CAGCGCATGC TGTTCTCGGC GACCCTGGAC AAGGCCATCG ACACCCTGGT GCGCCAGTAC
CTCAGCAGGC CCGCGGTACA CGAGGCCGAC TCCGCCCAGT CCCCGGTGGG CACGATGGCG
CACCATCTGC TGCACGTCGA GCGGGACAGC CGGCTTCCCG TGCTCGTCGA CCTGTCCAGC
GCCCCGGGCC GCGCGGTCGT GTTCACCCGC ACGAAGCACG GCGCCAAGGC GCTGGCCCGC
CAGCTCAACC GCGCCGGCGT GCGCGCGGTC GAGCTGCACG GCAACCTCAG CCAGAACGCC
CGTACCCGCA ACCTCGACGA CTTCCACACC GGCCGGGCCT CGGCGCTGGT CGCCACGGAC
ATCGCCGCCC GCGGCATCCA TGTCGACAAC GTGGCACTCG TGATCCACGC CGACCCACCG
GCCGACCACA AGGCATACCT GCACCGTTCC GGGCGCACCG CGCGTGCCGG CAACAGTGGC
ACCGTCATCA CCCTGGCGAC CACGGAGCAG ATCCGTGAGG TCACGCAACT GGCCCGCGCC
GCCGGCATCA GGCCGACGAC CACCCGGGTC GCCTCCGCCG CTCACCCGAT ACTCGCCACG
CTCGCCCCCG GCGCCCGGTC GCTCAGCAGC CCCGTGGTCG CGGCACAGGC GGAGACCAGG
AGCACTCCAG CCCAGCGCCG GCCGGTCGGT GAAGACCGCG GCACCGAGCC GACGCCTCGA
ACGGGCAGAC GGCGTGGACG CCGCCCCGCC GGCAACCACG GCGGCCGCGC CGCCGACCAC
GGCCGCCGCG CCGGGCGGTC CACGGCGTCC CGATCGCCCG CGGTCACGGC TCGTACCAGC
TCACCCGAGC ACGAGGCCGG CCCGTCACCC GCGAAGCGGT CCAGCTCGCG ACCGACCCAC
AGCGCCGCGG CGTTCAGCGC ACGCCGCGCC CGCTGA
 
Protein sequence
MSSFTDLGVP ARLTDVLTRL AIETPTPIQR ATLPDALAGR DVLGRGRTGS GKTLAFLLPV 
VARLSGGPRA QAGAPRALVL APTRELAAQI DTALRPLAAA AGLTSCTVFG GVGQEPQVAA
IRRGVDIVVA CPGRLEDLLA QGRCSLAQVK IVVLDEADHM ADLGFLPAVR RLIGQTAAGG
QRMLFSATLD KAIDTLVRQY LSRPAVHEAD SAQSPVGTMA HHLLHVERDS RLPVLVDLSS
APGRAVVFTR TKHGAKALAR QLNRAGVRAV ELHGNLSQNA RTRNLDDFHT GRASALVATD
IAARGIHVDN VALVIHADPP ADHKAYLHRS GRTARAGNSG TVITLATTEQ IREVTQLARA
AGIRPTTTRV ASAAHPILAT LAPGARSLSS PVVAAQAETR STPAQRRPVG EDRGTEPTPR
TGRRRGRRPA GNHGGRAADH GRRAGRSTAS RSPAVTARTS SPEHEAGPSP AKRSSSRPTH
SAAAFSARRA R