Gene Franean1_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1024 
Symbol 
ID5669438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1204613 
End bp1206052 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID641239953 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_001505386 
Protein GI158312878 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA CCACCAGCTC GCCGACCACG GCGGAGGGCC GGACTCCGGG GATCGGCCGA 
GTCGCCCGAG TCATCGGACC GGTCGTCGAC GTCGAGTTCG CCCCCGACGA GCTTCCCGAG
ATCTACACCG CGCTGCACGT CGACCGCACG ATCGACGGCG AGACCGCGGT CCTGACCCTT
GAGGTCGCGC AGCACATCGG CGACAACACC ATCCGCGCCA TCTCCATGCA GCAGACCGAC
GGCCTCGTGC GCGGGGCTCC GGTCCGCGAC ACCGGCGCGC CGATCTCCGT CCCGGTCGGG
AACGCCACCA AGGGCCACGT GTTCAACGTG CTCGGCAACC CGCTCGACGT GGACAAGGTC
GACGCCGAGA CCTACTGGCC GATCCACCGC TCGGCGCCGG CCTTCGACCA GCTCGAGTCG
AAGACGGAGA TGTTCACCAC CGGCATCAAG GTCATCGACC TGCTCGCCCC GTACGTGCGA
GGCGGCAAGA TCGGTCTGAT GGGCGGCGCC GGCGTCGGCA AGACCGTCAT CATCCAGGAG
ATGATCCGCC GGGTCGCCAA GGAGTTCGGT GGCGTGTCGG TGTTCGCCGG CGTCGGCGAG
CGCACCCGCG AGGGCAACGA CCTGTTCCTG GAGATGACCG AGGCCGGCGT CATCGAGGAC
ACCGCGCTCG TCTTCGGCCA GATGGACGAG CCGCCCGGCA CCCGGCTCCG GGTCGCCCTC
GGCGCGCTCA CCATGGCCGA GTACTTCCGG GATGTGCAGA AGCAGGACGT GCTCCTGTTC
ATCGACAACA TCTTCCGGTT CACCCAGGCC GGCTCCGAGG TGTCGACGCT GCTCGGCCGG
ATGCCCAGCG CCGTCGGCTA CCAGCCGACG CTGGCTGACG AGATGGGCGC CCTGCAGGAG
CGGATCACCT CGACCCGCGG TCACTCGATC ACCTCGCTGC AGGCGATCTA CGTCCCCGCG
GACGACCTGA CCGACCCGGC CCCGGCGACG ACGTTCACCC ACCTCGACGC CAACACGGTG
CTCGACCGGG CGATCTCCGA CCTCGGCATC TACCCGGCCG TGAGCCCGCT GGACTCGAAC
TCCCGGATCC TTGACGCCCG GTACATCGGG CAGGAGCACT ACGACACCGC CCGCGAGGTG
CAGCGGATCC TGCAGCGCTA CAAGGACCTG CAGGACATCA TCGCCATCCT CGGCATCGAC
GAGCTCTCCG AAGAGGACAA GATCCTCGTC AACCGGGCCC GCCGGATCCA GCGGTTCCTG
TCCCAGCCGT TCTTCGTCGC CGAGCAGTTC ACTGGCATCC CCGGCAAGTT CGTCCCGCTC
GACGAGACGA TCGACTCGTT CCGCCGGCTC ACCCAGGGTG ACTACGACCA CCTGCCCGAG
CAGGCGTTCT TCATGTGCGG CGGGATCGAG GACGCCGAGA AGAACGCGGA GAACCTGTAA
 
Protein sequence
MTVTTSSPTT AEGRTPGIGR VARVIGPVVD VEFAPDELPE IYTALHVDRT IDGETAVLTL 
EVAQHIGDNT IRAISMQQTD GLVRGAPVRD TGAPISVPVG NATKGHVFNV LGNPLDVDKV
DAETYWPIHR SAPAFDQLES KTEMFTTGIK VIDLLAPYVR GGKIGLMGGA GVGKTVIIQE
MIRRVAKEFG GVSVFAGVGE RTREGNDLFL EMTEAGVIED TALVFGQMDE PPGTRLRVAL
GALTMAEYFR DVQKQDVLLF IDNIFRFTQA GSEVSTLLGR MPSAVGYQPT LADEMGALQE
RITSTRGHSI TSLQAIYVPA DDLTDPAPAT TFTHLDANTV LDRAISDLGI YPAVSPLDSN
SRILDARYIG QEHYDTAREV QRILQRYKDL QDIIAILGID ELSEEDKILV NRARRIQRFL
SQPFFVAEQF TGIPGKFVPL DETIDSFRRL TQGDYDHLPE QAFFMCGGIE DAEKNAENL