Gene Franean1_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3811 
Symbol 
ID5672175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4525771 
End bp4528137 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content75% 
IMG OID641242690 
ProductATPase central domain-containing protein 
Protein accessionYP_001508110 
Protein GI158315602 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.308842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCT CTCGGGACGC CTCGCTCGTC GATCTGCTCG GCCGGCTCGC CGTCGTAGAG 
GAGCGGGTAC GGGCGGCCGT CGCCGCCCGG CGCGCCGTGG ACAGCGAGCC CGACGACGCC
TTCCGCGGCC TCTACCTGAG CGAGGAGCAG GTAAACGCGC TTCTCGGCCG GGCGGCATCG
GGCGGCGCCT GGTCCCTCGG CACACCCACC GTCCCGGCCG GGACGCCGCC TCCGGCGGAT
CCGGGCTCGG TGGGTCCGTC CGAGCTGTCC GTCGACGTGG ATCGCGCCGC TGCCGTGGCC
CTGTCCAGGG GGGAGTCGCT GCGGCTGCGC GACCTCGCCC GGCGGTGCGG CCTCACCGGG
CTGGACGTCG ACATCCTGCT GGTCGCGCTG GCGCCCGACC TGGACGCGCG GTTCGAGAAG
CTGTACGGCT ACCTGCAGGA CGACGTGACC CGCCGGCGTG CCAGTCCGGG GCTGGCCCTC
GAGCTGTGCG GGCGTTCCCC ACTGGACGCG GATGCCCGGG CACGGTGCAC GCCGGACGGT
CCCCTGGTCG GCGCCGGCCT GCTGATCGTG GAGGACGCCG AGCGGCCGTT CCTGAGCAGG
TCGCTGAGGG TCCCCGACCG GGTGGCCGGC TACCTGCTCG GCGACGACCG GCCGGGCGCG
GCACTCCGCG CGGTGCTCGC CGACGCGCCG GACGTCGGCG GCCCGCTGGT AGCGCGGCTC
GCCCGTGCGT TCCGGTCCGG GCAGCCGTCC GGGTCGTCCG GGTCGTCCGG GTCGTCCGGG
TCGGCCGAGT GTGCCGATCC CGCCGGCTCG GCGGTGAGCC TGGTCTACCT GCGTGCTGCA
CCCGGTGCCG ATGGGCTGGC CGTGGCGGCG TCCGCCTGTC GGCAGGCGGG CCGGCCCTTC
GTCGGGCTCG ACCTGGCCGC CGCCGCGCGG CGGCGGACCG GCGCGGGTGG AGCGGCGCCG
GGAGACGGAG CGGGCTCCGG CGCCGACCAT GCCGACCATG CCGGCCCCGC CCTCGACGTA
AGCGGTGGTG TGTCGCTGGT CGTGCGGGAG GGGATCCGGG AGGCGAGGCT TACCGGCGCG
GTGCTCATCA TCGGTCCGGT GGAGGCGCTC GACGGGGATC CCCTCGCCGC CGTCGTCGCC
GCTTCATCGG ACGTCGGGCC GGTGCCGCCG GTGCAGACCG TGGTGCACGG CCGGCGGGCC
TGGGAGCCGG GGACATCGGG GGACCCTCCA CTGGTCGTCG ATGTGGCGCC GCTCGGCGCG
GCCGAGCTGG CGACGGTCTG GGCGGCGGAA CTCGGAGCGG AGATGCCGGC CGAGCTGGCG
GCGTTCCGGC TTACCCCGAG GCAGGTGCGG CGGGCGGTCG CCACCGCCCG CGCGTCGATG
GTCGCCGACG GCGCTGCGCC CATGCCGGCG GCGCGGCCAC GGCTGGACCC TGTCTCGTTC
CCGGCCGGTA CGGCGCCGGG GCCGGCTTCG GGTGGTACGG CACCGGAGCC CGATCCGATC
CGGTTGGCGT CGGCGGCGCG GCAGCAGAAC GCGACCGGGC TGGAGCGGCT GGCCCGCCGC
ATCACCCCGG CGGTGGGCTG GGACGACCTG GTGCTGCCGC CGCGGACCCT CACCACCCTG
CGACATCTTG CCGGGCGGGC GGCCCACCGT GGCCAGGTGC TGGACGACTG GGGCCTGCGG
CGCGGCGGAG GCCGGGGCGA GGCGATCATC GCCCTTTTCG TCGGCGAGTC GGGTACCGGC
AAGACGATGG CCGCCGAGGT GGTCGCCGGC GCGCTCGGCG TCGACCTGTA CGTCATCGAC
TTGTCGACGG TCGTCGACAA GTACATCGGG GAGACCGAGA AGAACCTGGA GCGCATCTTC
ACCGGCGCGG AGGGGCTGAA CGCGGTGCTG TTCTTCGACG AGGCCGACGC GCTGTTCGGC
AAGCGGTCGG AGGTCGGCGA CGCGCGGGAC CGGTACGCGA ACGTGGAGGT CGCCTACCTG
CTGCAGCGGA TCGAGAGCTT CGACGGGGTG GCGATCCTGG CGACGAACCT GGCGGCGAAC
CTTGACGACG CGTTCCGCCG CCGGCTGTCG GTGGTCGCCG AGTTCGCCCG TCCGGACGTC
GAGGCACGGC TGGCGTTGTG GCGGCGGATG CTGCGCGGGG TCCCGCTGGC CGCCGACGTC
GACCTCGAGT TCTGCGCGAA GGCGTTCGAG CTCGCCGGCG GCGACATCCG CAACGCCGCC
GTCACGGCCG CCTACCTGGG CGCGGCCAAC GGCCAGGTGG TGGACATACG GGAGTTGGTG
ACCGCGATCG GCCTCGAATA TCGCAAGCTC GGGCGCCTCT GCCTGGCCGA CGAGTTCGGC
CCCTACTTCG ACCTGCTGCC GCGCTGA
 
Protein sequence
MPASRDASLV DLLGRLAVVE ERVRAAVAAR RAVDSEPDDA FRGLYLSEEQ VNALLGRAAS 
GGAWSLGTPT VPAGTPPPAD PGSVGPSELS VDVDRAAAVA LSRGESLRLR DLARRCGLTG
LDVDILLVAL APDLDARFEK LYGYLQDDVT RRRASPGLAL ELCGRSPLDA DARARCTPDG
PLVGAGLLIV EDAERPFLSR SLRVPDRVAG YLLGDDRPGA ALRAVLADAP DVGGPLVARL
ARAFRSGQPS GSSGSSGSSG SAECADPAGS AVSLVYLRAA PGADGLAVAA SACRQAGRPF
VGLDLAAAAR RRTGAGGAAP GDGAGSGADH ADHAGPALDV SGGVSLVVRE GIREARLTGA
VLIIGPVEAL DGDPLAAVVA ASSDVGPVPP VQTVVHGRRA WEPGTSGDPP LVVDVAPLGA
AELATVWAAE LGAEMPAELA AFRLTPRQVR RAVATARASM VADGAAPMPA ARPRLDPVSF
PAGTAPGPAS GGTAPEPDPI RLASAARQQN ATGLERLARR ITPAVGWDDL VLPPRTLTTL
RHLAGRAAHR GQVLDDWGLR RGGGRGEAII ALFVGESGTG KTMAAEVVAG ALGVDLYVID
LSTVVDKYIG ETEKNLERIF TGAEGLNAVL FFDEADALFG KRSEVGDARD RYANVEVAYL
LQRIESFDGV AILATNLAAN LDDAFRRRLS VVAEFARPDV EARLALWRRM LRGVPLAADV
DLEFCAKAFE LAGGDIRNAA VTAAYLGAAN GQVVDIRELV TAIGLEYRKL GRLCLADEFG
PYFDLLPR