Gene Franean1_5437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5437 
Symbol 
ID5673768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6575755 
End bp6579186 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content75% 
IMG OID641244292 
ProductATPase-like protein 
Protein accessionYP_001509698 
Protein GI158317190 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0313008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACG GTTTCCGCTT CCACAGGCTG CTTTCCGCCC CCCGCCCGCC GGACCCGTCG 
CAGCGGCCCG ACCCGGCGCC AGCCCAGCTG TTCGCCGCGC TCATCGCGGC GCACGCCGAC
CTGACGCAGG ACGCAGACGT CGACGCCGCT CCCCACCCGC GTGCCCCGGA CCGGGTTCCC
GCGGTGGCGG CCGGCACGGC TGCGGCCAGG GCGCCGGGTA CGGCTGCGGT CGCGGTCGCC
TGGATGCGGG TGCCCGGCGA CCCACATCTG CGGATTCTGG TTGGCGGGAA CCCGTACTTC
CCGGCGGCGG GCACGGTTGC GTCCGCCGCC GTGGAAGGAG TGACGGTGCC GGTGCTGTAC
CCGCCGGGCA GCACCGGCGT TCCGGTCGAC ACGGCCGACC TCGTGGCGGC ACTGGCCGGG
TTCCCGTGCT GGCTGCGGTG CGAAGGGACG GCCGACGCAC TCTGGGGCAC AGCCGCCGGC
AGCGGTGCCC ACAACGGCAA CGGCAGCGGC AGCGGAGCCC GGCACGGCGA CGGGCCGGCC
CGGCACGGCT CCTTCGACGA CTACGCCGCC CACCTGCCCG GCGCCTTCGC CTGGCTGGTA
ATCGGGCAGC CGGTGCCCAC CGGGGTCGTC GACACCGAAC GCGGCGACCT GCTACGGATG
ATCCCCTTCC TGCGCCAGCG GGAGCGAGAC GAACACGCCC GTCTCGAGCT GGAACGAGCC
GAGGCCCGCT ACCGCGAGCT GACCCGTTCG ACAGCGACCG GAATGTGGGA CGTGCGGGTG
CTCGTCGGTG CGCCGACGAG CACAGCGGCG CGACTGACCG CCGCGCTGCT GTGCGGCGCC
GGAGACCTCG ACGATCTTCC CTATCTCCTC ACCCCGGCGC GGGGCGGGCC GACCTCGTTG
GCCGAGGCCC TTGGCCCGAC CACGATGCCC CGGGCCGCCC AGGCGCCGCA GGCCACCCAG
GCGCCGCAGG CCATCGGCGC CGGCCCGGGC GGCACGTGGA CGGCTGGCCC GTCAGGTCCC
GCCGGGCGTC CCGGGTTCGG GCCCGGCCCC GGAGGACCCG GCCCCGGCCC CGGCCCCGGC
CCCGGCCGTG CCGGACCGGG AGGCATCGTC ACGGGGCCGG CGGGCAATCC CGGGACCGGG
CGTCCCGGCC CCGCCGTCGG CGGTCCGGCG ACAGCCGGCG TCGACGCGAG CTTTCCGTTC
CGTGCGGGCG GGGAGCTCCT CGCCGCGCTC GCGCGCCCGC CCAGGCGCGA ACTCCCGGGG
ATCACACTGG TGACCCCGCA CACCTTCGAC GTCACCCCTG AGGGAGTCAT CCAGGCAAGC
CGGCCGATGG CACCGGCGGC GACCTTGAAC GGGACGGCGC ACAGGCCCGC CGGCGAGGCG
GCGAGCCTCC CCGGGACGGC GACCTGCCTC CCTGGGACGG CGACCGTGCC CGACGGCGGG
CTCGCGCTCG GTCAGGTGCT CGACGCGGCC TGGTCGCCGG CGGGCGTCCT GCCCGTGCCG
CGCTCGACGC TGAACCGGCA CGGCTTCGTC AGCGGCGCGA CCGGCTCCGG CAAGTCGCAG
ACGGTCCGAT CACTGCTCGA GTCGCTGGCG CGGGCGGCGG ACCCGGTGCC GTGGCTGGTG
CTGGAACCAG CGAAGGCCGA GTACGCCCGG ATGGCCGGTC GGCTCGCGGG CCACAGCGAA
GTCATCGTGA TCAAACCGGG GCGCCTGGAC GCGCCACCGG CGGCCCTCAA CCCGCTGGAA
CCGGAACCGG GGTTCCCGCT CCAGAGCCAC ATCGACCTCG TCCGCGCACT GTTCCTGGCC
GCGTTCGAGG CGCACGAGCC GTTCCCCCAG GTACTCGCCC GGGCCCTGAC CGTCTGCTAC
ACCGACGCCG GCTGGGATCT GGTGGCCGGC CAGATGCGGC CCGAGCACCG GCCGCGGTTC
CGCGGCGACG AACCGGCCGT ACCCGTGCGG CCCCGCTATC CGACGCTCGG CGACCTGCAG
CGCACCGCCG CCCGGGTCGT CGAGAACATC GGCTACGGCG CCGAGGTGAC CGCGGACGTG
CGCGGTTTCG TCGACGTGCG GATCGGCTCG CTGCGGGAGG GGACGCCCGG ACGGTTCTTC
GAAGGCGGTC ACCCCCTCGA CGTCGGCGCG CTCCTGACCC GCAACGTGGT CTTCGAGCTC
GAGGACATCA CCAACGACCA GGACAAGGCC TTCCTCCTCG GCGCGGTGCT GATCCGCATC
GTCGAGCACC TGCGGGTCAA GTACGGCAGC GCGGGGACGG GCGAGCTGCG GCATGTCCTC
GTCGTCGAGG AGGCCCACCG GCTGCTCAGG AACGTCGAGA GCGGCCCGGC CGCAGCCGCC
GTCGAGCTGT TCGCGTCGCT GCTCGCCGAG ATCCGCGCCT ACGGCGAAGG CGTCCTCGTC
GTCGAACAGA TCCCGTCGAA GATCATTCCT GACGTGCTGA AGAACACCGC GCTGAAGGTG
ATGCACCGGC TGCCCTCGGC GGAGGACAGA GCGGCCGTCG GCGCCACGAT GAACCTGAGG
GAGGAGCAGT CCGAGCTGGT CGTCGCGCTG CCGCCGGGCG TGGCGGCGGT AGCGGTCGAC
GGGATGGACC GTCCGGTGCT CGTCCGGATG ACGCCGGGCT CGGATCGGGA GTCGATGGGC
GAAGCCAGCT ACGACGGCGC GCCACTCGGC GGACGACGGA GCATCCTGTG CGGCGCGGAC
TGCCGGCGTG ACGGCGCCTG CACGCTTCGC CAGATCAACG ACGCGGCTCA CCTGGCTTCC
GACCCCCGCC TGGTCGTCTG GGTCGAGACG GTGGCGGCGC ACCAGGTCAT CGGCTTGGCC
CATCTGCTCG GCCAGGGGCC ACCGAAACCG GTGGCGCCGC TGTCCGGTGA GCTGCTCGCC
CTGCCGACCC GGACCAGGGA CTGCCTGCTC GCGCAGGCGG TGGACCGGGC GGTCGCCGCA
CGGGCGACCC TGCTGCGCGG CCACGTCGAT CCGGACGACT TCGCCGGCCG GCTCGCGGGC
ACACTCGCCG GACTGCTCGC CGGCGCCGCG CAGGACGACG CTGACACCCG TCGCTGGACC
GCCGGCTCGT ACCGCTACCA GGACGTCCGC CGGACCCTGA ACGAGGCCCA GACCGGCCCC
AACGCCGGCC GCCCCCACCC GGAGACGCCC GACTGGCGCC GCCGTGGGCT GTCACTGGTG
AGCCAGACGG TGCCCGCGCA GTTCGCGGAG CTCCGCGACA GCGCCCACTA CGCCCCCGGC
GCGGACGCCG TGAGCCTCGG TGACCTGGCG ACCAGTGGCC TCGCCGTCGC CGTGAAGACG
CTTACCGGCG GGACGTCCAA GGAACACCTG CGCGCGGCGC TGCGCCTGGC GTGCGGCGAC
GCCGACCTGG AGGCGCTGCA CTTCCAGGTC GCCGACCTGC TGGAGAAGTA CGTCGTGGAC
GGAGGGCGCT GA
 
Protein sequence
MLDGFRFHRL LSAPRPPDPS QRPDPAPAQL FAALIAAHAD LTQDADVDAA PHPRAPDRVP 
AVAAGTAAAR APGTAAVAVA WMRVPGDPHL RILVGGNPYF PAAGTVASAA VEGVTVPVLY
PPGSTGVPVD TADLVAALAG FPCWLRCEGT ADALWGTAAG SGAHNGNGSG SGARHGDGPA
RHGSFDDYAA HLPGAFAWLV IGQPVPTGVV DTERGDLLRM IPFLRQRERD EHARLELERA
EARYRELTRS TATGMWDVRV LVGAPTSTAA RLTAALLCGA GDLDDLPYLL TPARGGPTSL
AEALGPTTMP RAAQAPQATQ APQAIGAGPG GTWTAGPSGP AGRPGFGPGP GGPGPGPGPG
PGRAGPGGIV TGPAGNPGTG RPGPAVGGPA TAGVDASFPF RAGGELLAAL ARPPRRELPG
ITLVTPHTFD VTPEGVIQAS RPMAPAATLN GTAHRPAGEA ASLPGTATCL PGTATVPDGG
LALGQVLDAA WSPAGVLPVP RSTLNRHGFV SGATGSGKSQ TVRSLLESLA RAADPVPWLV
LEPAKAEYAR MAGRLAGHSE VIVIKPGRLD APPAALNPLE PEPGFPLQSH IDLVRALFLA
AFEAHEPFPQ VLARALTVCY TDAGWDLVAG QMRPEHRPRF RGDEPAVPVR PRYPTLGDLQ
RTAARVVENI GYGAEVTADV RGFVDVRIGS LREGTPGRFF EGGHPLDVGA LLTRNVVFEL
EDITNDQDKA FLLGAVLIRI VEHLRVKYGS AGTGELRHVL VVEEAHRLLR NVESGPAAAA
VELFASLLAE IRAYGEGVLV VEQIPSKIIP DVLKNTALKV MHRLPSAEDR AAVGATMNLR
EEQSELVVAL PPGVAAVAVD GMDRPVLVRM TPGSDRESMG EASYDGAPLG GRRSILCGAD
CRRDGACTLR QINDAAHLAS DPRLVVWVET VAAHQVIGLA HLLGQGPPKP VAPLSGELLA
LPTRTRDCLL AQAVDRAVAA RATLLRGHVD PDDFAGRLAG TLAGLLAGAA QDDADTRRWT
AGSYRYQDVR RTLNEAQTGP NAGRPHPETP DWRRRGLSLV SQTVPAQFAE LRDSAHYAPG
ADAVSLGDLA TSGLAVAVKT LTGGTSKEHL RAALRLACGD ADLEALHFQV ADLLEKYVVD
GGR