Gene Francci3_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3709 
Symbol 
ID3903810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4442416 
End bp4444074 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content68% 
IMG OID637881035 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_482790 
Protein GI86742390 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGC TCTCCATCCG GCCGGAGGAG ATCCGCGACG CCCTGCGGGA GTACGTCGAC 
TCTTTCCAGG CCACCTCAGC CGGCCGTGAG GAGGTCGGCC GGGTCGTCGT CACCGGTGAC
GGCATCGCCC GGGTCGAGGG TCTCCCGCAC ACGATGACCA ACGAGCTGCT GGAGTTCCAC
GGCGGTGTGC TGGGTCTCGC GCTGAACCTC GAGGTCGGCG AGATCGGAAC CGTCATCCTG
GGTGAATCCG AGAACATCGA GGAGGGGCAG GAGGTCCGCC GCACCGGCCA GATCCTCTCC
GCTCCCGTCG GTGACGGCTT CCTCGGCCGG GTCGTCGACC CGCTCGGGCG GCCGATCGAC
GGCCTCGGGG AGATCGTGGC CGAGGGATCC CGGGAGCTGG AGCTGCAGGC GCCGACGGTC
GTCCAGCGCC AGCCGGTGAA GGAACCCCTG CAGACCGGTA TCAAGGCGAT CGACGCGATG
ACGGCCATCG GCCGTGGTCA GCGCCAGCTG ATCATCGGCG ACCGGCAGAC CGGCAAGACG
ACGGTGGCCA TCGACGCGAT CATCAACCAG CGGGACAACT GGACCTCTGG TGATCCGTCG
AAGCAGGTCA AGTGCGTCTA CGTGGCGATC GGCCAGAAGA AGTCGACAAT CCGCGAGGTG
GTGAACTCTC TGGAGGAGGC CGGCGCGCTG GCCTACACCA CGATCGTGGC GGCGCCGGCT
GACGAGCCGG CGGGCTTCAA GTACATCGCG CCCTACACCG GCTCGGCGAT CGCCCAGCAC
TGGATGTACA ACGGCCAGCA CGCGCTGATC GTCTTTGACG ACCTGACCAA GCAGGCCGAG
GCCTACCGCG CGATCTCGCT GCTGCTGCGC CGTCCGCCGG GCCGCGAGGC CTACCCGGGT
GACGTCTTCT ACCTGCACTC CCGCCTGCTG GAGCGCTGCG CCAAGCTCTC CGACGAGCTC
GGCGGCGGCT CGCTGACCGG ACTGCCGATC ATCGAGACGA AGGCCAACGA CATCTCGGCC
TACATCCCGA CGAACGTCAT CTCGATCACC GACGGGCAGG TCTTCCTGGA GTCCGACCTG
TTCAACCAGG GTGTCCGCCC GGCCATCAAC GTCGGCACCT CGGTGTCCCG GGTCGGCGGT
AGCGCGCAGG TTAAGGCGAT GAAGTCGGTC GCCGGCCGCC TGCGGCTGGA CCTCGCCCAG
TACCGCGAGC TCGAGGCGTT CTCGGCCTTC GGTTCCGACC TGGACAAGGC CTCGCGGGAC
CAGCTCGCGC GCGGCGCCCG CCTGGTCGAG CTGCTCAAGC AGCCGCAGGG CCAGCCCTTC
CCGGTCGAGC GCCAGGTCGT GTCGATCTGG GCCGGAACCA CCGGCAAGCT CGACGATGTG
CCCGTGGCGG ACATCCGTCG CTTCGAGTCG GAGTTCCTGG ACTTCGTCGG CCGGTCTTAC
CCGGGGGTGT ACGACGCGAT CGTGACCACC GGCAAGCTCA GCGACGACAC GATCGCCATG
CTGGAGTCGG CCGTCGCGGA GTTCAAGAAG CAGTTCACGC TGTCCGACGG CAAGCCGTTG
GTCAACGAGC CGGCGCCGAG CCCGCTCGAC CCGGGGCTGG TGCGGCAGGA GTCGATCCCG
GTGCACCGGC CCGCGGCGCG CAAAGACGAC GAGGGCTGA
 
Protein sequence
MTELSIRPEE IRDALREYVD SFQATSAGRE EVGRVVVTGD GIARVEGLPH TMTNELLEFH 
GGVLGLALNL EVGEIGTVIL GESENIEEGQ EVRRTGQILS APVGDGFLGR VVDPLGRPID
GLGEIVAEGS RELELQAPTV VQRQPVKEPL QTGIKAIDAM TAIGRGQRQL IIGDRQTGKT
TVAIDAIINQ RDNWTSGDPS KQVKCVYVAI GQKKSTIREV VNSLEEAGAL AYTTIVAAPA
DEPAGFKYIA PYTGSAIAQH WMYNGQHALI VFDDLTKQAE AYRAISLLLR RPPGREAYPG
DVFYLHSRLL ERCAKLSDEL GGGSLTGLPI IETKANDISA YIPTNVISIT DGQVFLESDL
FNQGVRPAIN VGTSVSRVGG SAQVKAMKSV AGRLRLDLAQ YRELEAFSAF GSDLDKASRD
QLARGARLVE LLKQPQGQPF PVERQVVSIW AGTTGKLDDV PVADIRRFES EFLDFVGRSY
PGVYDAIVTT GKLSDDTIAM LESAVAEFKK QFTLSDGKPL VNEPAPSPLD PGLVRQESIP
VHRPAARKDD EG