Gene Sare_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3524 
Symbol 
ID5704652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4065440 
End bp4067050 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content65% 
IMG OID641272951 
Productcytochrome b/b6 domain-containing protein 
Protein accessionYP_001538317 
Protein GI159039064 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00291671 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGCGGC GAAAGTTTGA GATGGCAGCC GTCCCGGGCA ACGTGGCCCG AGGGGTGGAC 
GACCGCTTCC AGGTGGCTAC CCCGCTTCGG CGGCTGCTGA ACAAGGTCTT CCCGGACCAC
TGGTCCTTCC TGCTGGGTGA GATCGCGCTT TTCTCGTTCA TCATCCTGCT TCTCACCGGG
GTCTTCCTGA CCTTCTTCTT CGAGCCGGCG ATGACCGAGG TGATCTACAA CGGCAGTTAC
GCCCCGCTGC GGGGCACGCC GATGTCCGCC GCATACGCCT CCAGTCTGGA CATCTCGTTC
GACATTCGAG GTGGCCTGAT CATGCGGCAG ATGCACCACT GGTCGGCCCT GCTGTTCATG
GCCGCGATCG TGGTGCACAT GATGCGGGTC TTCTTCACCG GTGCGTTCCG TAAGCCGCGG
GAGGCCAACT GGATCATCGG CTCGCTGCTG TTCTGGGTCG GCTTCCTGGC CGGTTTCACC
GGCTACTCCC TGCCGGACGA CGGACTCTCC GGCACCGGGC TGCGGATCGC CTCCGGGATC
ATGCTGTCGA TCCCGGTGAT CGGCTCCTGG CTGACCTCGT CGATCTTCAA CGGCGAGTTC
CCGGGCACGA TCATCATCAG CCGGTTCTAC ATCGCGCACG TGCTGCTCAT CCCCGGCCTG
CTGCTCGCTC TGATCGGTGC CCACCTGGGG CTGGTCTTCA AGCAGAAGCA CACCCAGTGG
CCCGGCCCCG GCCGGACCAA CGACAACGTG GTGGGCGAGC GGATGTTCCC GCGGTACGCG
TTGAAGCAGG GCGGCTTCTT CATGGTCGTC TTCGGTGTGA TCGCGCTGAT GGGTGGCCTG
CTCCAGATCA ACCCGATCTG GCTGTTCGGG CCGTACGAGG CGTGGGTGGT CTCGGCTGCC
AGCCAGCCCG ACTGGTACGT CATGTTCCTC GACGGCTCCA CCCGGCTGAT GCCCGCCTGG
GAGATATACA TACCGATCGG CGACGGGTAC GTCATCCCGC CGCTGTTCTG GCCAACGGTC
GTCCTGCCCG GGCTACTGGT GGGAATGTCG GTCCTCTATC CGTTCATCGA GGCCCGACGG
CTCAAGGACC ACCGCAGCCA CAACCTGCTC CAGCGGCCGC GGGACGTTCC GGCGCGTACC
GGGCTGGGTG CCATGGCGGT CACCTTCTAC CTGGTGTTGG CCCTGTCCGG GGCGAACGAC
GTCATCGCCG ACAAGTTCAA CATCAGCCTG AACGCGATGA CCTGGGGCGG CCGGATCGGT
CTGCTGCTTC TCCCACCGCT GGCGTACTAC GTCACCTACC GGATCTGCCT GGGCCTCCAG
CAGCACGATC GGGAGGTTCT GGCCCACGGT GTCGAGACCG GCATCATCCG GCGTCTGCCG
GATGGCCGAT TCGTCGAGGT CCACCAGCCG CTCCACGCCG AAGACGGCGA ACTGGAGTAC
GTGGGCTGGG TGGTGCCGAA GAAGATGAAT CGACTCGGCG CACTCGGCCC GGCGATCCGC
GGCTTCTTCT ACCCGATCGA GAAGCCAGCC GAGGCACCGG TGTCGCCGGG GCACCCGCCG
GTCGAGTCGG ACGCCGAGCA GTCGGAGATC ACCAGCGGCG AACGCCGTTG A
 
Protein sequence
MKRRKFEMAA VPGNVARGVD DRFQVATPLR RLLNKVFPDH WSFLLGEIAL FSFIILLLTG 
VFLTFFFEPA MTEVIYNGSY APLRGTPMSA AYASSLDISF DIRGGLIMRQ MHHWSALLFM
AAIVVHMMRV FFTGAFRKPR EANWIIGSLL FWVGFLAGFT GYSLPDDGLS GTGLRIASGI
MLSIPVIGSW LTSSIFNGEF PGTIIISRFY IAHVLLIPGL LLALIGAHLG LVFKQKHTQW
PGPGRTNDNV VGERMFPRYA LKQGGFFMVV FGVIALMGGL LQINPIWLFG PYEAWVVSAA
SQPDWYVMFL DGSTRLMPAW EIYIPIGDGY VIPPLFWPTV VLPGLLVGMS VLYPFIEARR
LKDHRSHNLL QRPRDVPART GLGAMAVTFY LVLALSGAND VIADKFNISL NAMTWGGRIG
LLLLPPLAYY VTYRICLGLQ QHDREVLAHG VETGIIRRLP DGRFVEVHQP LHAEDGELEY
VGWVVPKKMN RLGALGPAIR GFFYPIEKPA EAPVSPGHPP VESDAEQSEI TSGERR