Gene Sare_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4014 
Symbol 
ID5707436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4566489 
End bp4568141 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content68% 
IMG OID641273439 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001538795 
Protein GI159039542 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000699734 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCGAGC TGACCATCTC GACGGAGGAG ATCCGCGGCG CGCTGGAGCG CTACGTCTCC 
TCCTACACCG CCGACGTCTC CCGCGAGGAG GTCGGCACCG TCGCTGACGC CGGTGACGGC
ATCGCCCACG TCGAGGGCCT GCCCTCGACC ATGACCAACG AGCTGCTCGA GTTCGAGGAC
GGCACGCTCG GCGTGGCGCT GAACCTCGAC GTTCGGGAGA TCGGTGTCGT CGTCCTCGGT
GACTTCGGCG GTATCGAGGA GGGGCAGCGG GTCAAGCGCA CCGGCCGGGT GCTTTCCGCA
CCGGTCGGCG ACGCCTTCCT CGGCCGCGTG GTCAACGCGC TCGGCCACCC GATCGACGGC
CTCGGCGACA TCGCGAACGA GGGCTTCCGG GAGCTGGAGC TCCAGGCTCC GAACGTGATG
GCCCGCAAGT CGGTTGACGA GCCGCTGCAG ACCGGCATCA AGGCAGTTGA CGCGATGACC
CCGATCGGTC GGGGCCAGCG GCAGTTGATC ATCGGTGACC GGAAGACCGG CAAGACCACC
GTCGCCCTGG ACACCATCCT CAACCAGCGG GACAACTGGC GCTCCGGCGA CCCGAAGAAG
CAGGTCCGCT GCATCTACGT CGCCGTCGGC CAGAAGGCTT CCACGATCGC CTCGATCAAG
GGCGTGCTCG AGGAGGCCGG CGCGATGGAA TACACCACCA TCGTGGCGTC CCCGGCATCC
GACCCGGCCG GCTTCAAGTA CCTCGCCCCG TACACCGGCT CGACGATCGG GCAGCACTGG
ATGTACGGCG GCAAGCATGT TCTCGTCGTC TTCGACGACC TGAGCAAGCA GGCCGAGGCG
TACCGGGCCG TCTCGCTGTT GCTGCGCCGT CCGCCGGGCC GTGAGGCGTA CCCAGGCGAC
GTCTTCTACC TGCACTCCCG CCTGCTGGAG CGCTGCGCGA AGCTCTCCGA CGAGATGGGC
GGCGGCTCGA TGACCGGTCT GCCGATCATC GAGACCAAGG CGAACGACAT CTCGGCGTTC
ATCCCGACCA ACGTCATCTC GATCACCGAC GGTCAGATCT TCCTGGAGAC CGACCTGTTC
AACCAGGGCG TCCGGCCGGC CATCAACGTC GGCACCTCGG TCTCCCGGGT AGGCGGCGCC
GCGCAGGTGA AGCCGATGAA GAAGGTCGCT GGTTCGCTGC GGCTGAACCT GGCCCAGTAC
CGTGAGCTGG AGGCGTTCGC CGCCTTCGCC TCGGACCTGG ACAAGGCGTC CCGGGCCCAG
CTGGAGCGGG GTTCCCGCCT GGTCGAGCTG CTCAAGCAGC CGAACTACTC ACCGTTCCCG
GTGGAGGAGC AGGTCGTCTC GGTCTGGGCC GGTACCGAGG GCAGGCTGGA TGACATCCCG
GTCGGCGAGA TCCGTCGCTT CGAGTCCGAG TTCCTGCAGT ACCTGCGGCA CAAGCACGAG
GGGGTCCTGG CGGGGATCGC GGCCGGCACG TGGGGAGACG AGATCATCGC TTCCCTCGAC
GCGGCGATCA GCGACTTCAA GAACCTCTTC CTGGGCAAGG AGGACGAGCA GCGGGTCAAC
GAGCCGCCTG CGAAGCCGCT GGCCGGTGAG GAGAACCGCG AGACGGTGAC CCGGTTCCGC
GACGGCACGA CCGACCGTCC GGCCGAGAGC TGA
 
Protein sequence
MAELTISTEE IRGALERYVS SYTADVSREE VGTVADAGDG IAHVEGLPST MTNELLEFED 
GTLGVALNLD VREIGVVVLG DFGGIEEGQR VKRTGRVLSA PVGDAFLGRV VNALGHPIDG
LGDIANEGFR ELELQAPNVM ARKSVDEPLQ TGIKAVDAMT PIGRGQRQLI IGDRKTGKTT
VALDTILNQR DNWRSGDPKK QVRCIYVAVG QKASTIASIK GVLEEAGAME YTTIVASPAS
DPAGFKYLAP YTGSTIGQHW MYGGKHVLVV FDDLSKQAEA YRAVSLLLRR PPGREAYPGD
VFYLHSRLLE RCAKLSDEMG GGSMTGLPII ETKANDISAF IPTNVISITD GQIFLETDLF
NQGVRPAINV GTSVSRVGGA AQVKPMKKVA GSLRLNLAQY RELEAFAAFA SDLDKASRAQ
LERGSRLVEL LKQPNYSPFP VEEQVVSVWA GTEGRLDDIP VGEIRRFESE FLQYLRHKHE
GVLAGIAAGT WGDEIIASLD AAISDFKNLF LGKEDEQRVN EPPAKPLAGE ENRETVTRFR
DGTTDRPAES