Gene Franean1_4882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4882 
Symbol 
ID5673222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5856509 
End bp5858314 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content65% 
IMG OID641243737 
ProductATPase central domain-containing protein 
Protein accessionYP_001509153 
Protein GI158316645 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000159946 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.231188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGTC CCCGTTCGGG CTCTGGCTCC GGTGGGAGCA CGGGTCGTCC TGGTGACGCC 
GATTCTCAAC GGTCGGCGTA CGAGAAGGAA GTACACGAAC TCACGACTCA GGTCACCTTC
CTGGAGGAAG AAGTGGCCAT GCTGCGGCGG AGGCTGTCTG AATCGCCCCG ACAGGTACGT
GTCCTGGAGG AGCGACTAGC CCAGGTACAG GTGGAGCTAC AGACCGCCAC TGGGCAAAAC
GACAAGCTCG TCGCCACCCT TCGGGAGGCA CGTGACCAGA TCATCTCGTT GAAGGAGGAG
GTCGACCGGC TCGCGCAACC GCCGAGCGGG TACGGCGTCT TCATCCGTGG CTACGACGAC
GGCACGGTCG ACGTGTTCAC GCAGGGCAGA AAGCTCCGCG TGACGGTGTC GCCGAACGTC
GAAGCCGACG TCCTGCAGCC CGGTCAGGAG GTCATGCTCA ACGAGGCGCT CAACGTCGTG
GAGGTTCGCG CGTTCGAGCG GCAGGGCGAG ATCGTCCTGC TCAAGGAGGT CCTCGAGAGC
GGTGACCGGG CGCTGGTGAT CGGTCACACC GACGAGGAAC GGGTCGTCAT GCTGGCCCAG
CCACTCCTCG ACGGCCCGAT CCGAGCCGGC GACTCGCTCC TCATCGAGCC ACGGTCGGGG
TACGCCTTCG AGCGGATCCC CAAGTCCGAG GTCGAGGAGC TGGTCCTCGA AGAGGTCCCA
GACATCGGCT ACGAGCAGAT CGGCGGCCTG AAGGGCCAGA TCGAGTCGAT CCGCGACGCG
GTCGAGCTGC CGTTCCTCTA CAAGGAACTG TTCCTGGAGC ACAAGCTCAA GCCACCGAAG
GGCGTGCTGC TCTACGGCCC GCCCGGCTGT GGCAAGACGC TGATCGCCAA GGCCGTGGCG
AACTCCCTGG CCAAGAAGGT CGAGGCACAG ACAGGCCAGG GCTCCGGCCG GGCCTTCTTC
CTCAACATCA AGGGCCCGGA GCTGCTCAAC AAGTACGTAG GCGAGACCGA GCGGCAGATC
CGGCTGGTGT TCCAGCGGGC GCGCGAGAAG GCGTCCGAGG GCATGCCGGT GATCGTGTTC
TTCGACGAGA TGGACTCGAT CTTCCGGACC CGTGGCTCGG GTGTCTCCTC GGACGTGGAG
AACACGATCG TCCCGCAGCT GCTCAGCGAG ATCGACGGCG TTGAGCAGCT CGAGAACGTC
ATCGTGATCG GCGCGTCCAA CCGAGAGGAC ATGATCGACC CGGCGATCCT GCGGCCGGGC
CGGCTCGACG TGAAGATCAA GGTCGAGCGT CCGGACGCCG AAGCGGCCAA GGACATCTTC
GCCAAGTACG TCCTGCCCGA GCTCCCGCTG CACGCCGACG ACCTCGCCGA GCACGGAGGT
AACCGGGAGG CGACCTGCCA GGGCATGATC CAGCGGGTCG TCGAGCGGAT GTACGCCGAG
AGCGAGGAGA ACCGCTTCCT CGAGGTCACC TACGCCAACG GTGACAAGGA GGTCCTGTAC
TTCAAGGACT TCAACTCGGG CGCGATGATC GAGAACATCG TGGCCCGGGC GAAGAAGATG
GCGGTGAAGG ACCTCATCGA GAGCGGAGTC CGCGGCCTGC GCATGCAGCA CCTGCTGTCG
GCGTGCCTGG ACGAGTTCAA GGAGAACGAG GACCTGCCGA ACACCACGAA CCCGGACGAC
TGGGCCCGGA TCTCCGGCAA GAAGGGTGAG CGGATCGTCT ACATCCGCAC ACTCGTCACC
GGAACCAAGG GCACCGAGGC CGGGCGGTCG ATCGACACCA TCGCGAACAC CGGCCAGTAC
CTCTAG
 
Protein sequence
MSGPRSGSGS GGSTGRPGDA DSQRSAYEKE VHELTTQVTF LEEEVAMLRR RLSESPRQVR 
VLEERLAQVQ VELQTATGQN DKLVATLREA RDQIISLKEE VDRLAQPPSG YGVFIRGYDD
GTVDVFTQGR KLRVTVSPNV EADVLQPGQE VMLNEALNVV EVRAFERQGE IVLLKEVLES
GDRALVIGHT DEERVVMLAQ PLLDGPIRAG DSLLIEPRSG YAFERIPKSE VEELVLEEVP
DIGYEQIGGL KGQIESIRDA VELPFLYKEL FLEHKLKPPK GVLLYGPPGC GKTLIAKAVA
NSLAKKVEAQ TGQGSGRAFF LNIKGPELLN KYVGETERQI RLVFQRAREK ASEGMPVIVF
FDEMDSIFRT RGSGVSSDVE NTIVPQLLSE IDGVEQLENV IVIGASNRED MIDPAILRPG
RLDVKIKVER PDAEAAKDIF AKYVLPELPL HADDLAEHGG NREATCQGMI QRVVERMYAE
SEENRFLEVT YANGDKEVLY FKDFNSGAMI ENIVARAKKM AVKDLIESGV RGLRMQHLLS
ACLDEFKENE DLPNTTNPDD WARISGKKGE RIVYIRTLVT GTKGTEAGRS IDTIANTGQY
L