Gene Sare_3139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3139 
Symbol 
ID5706349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3572128 
End bp3573858 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content68% 
IMG OID641272571 
Productnitrate reductase, beta subunit 
Protein accessionYP_001537938 
Protein GI159038685 
COG category[C] Energy production and conversion 
COG ID[COG1140] Nitrate reductase beta subunit 
TIGRFAM ID[TIGR01660] nitrate reductase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.250265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00103191 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGGTGA TGGCGCAGAT GGCAATGGTG ATGAACCTCG ACAAGTGCAT CGGCTGTCAC 
ACCTGCTCGG TGACCTGCAA GCAGGCGTGG ACCAACCGAT CCGGGGTCGA GTACGTCTGG
TTCAACAACG TGGAGACCCG CCCCGGTCAG GGCTACCCCC GTACCTACGA GGACCAGCAG
CGGTGGCAGG GCGGGTGGGT GCGCACCCGG TCCGGGCGGC TCAAGCCCCG CTCGGGCGGA
CGGCTGAAGA AGATGTTCAC CGTTTTCGCC AACCCGAAAC TGCCCTCCAT GCGGGACTAC
TACGAGCCCT GGACGTACGA CTACGAGCAC CTGATCAGCG CGCCGTCCGG CGACGACATC
CCGGTCGCCC GCCCGAAGTC CCTGATCACC GGCCAGGACA CGAAGATCAC CTGGAGTGCG
AACTGGGACG ACTCCCTGGC CGGGGGTAAC GAGGTCACGG CGGGTGATCC GGTGTTGGCA
AAGGTGTCCG AGCAGGTCCG GCAGGAGTAC GCGAAGACCT TCATGTTCTT CCTGCCCCGC
ATCTGCGAAC ACTGCCTCAA TCCGTCCTGC GCCGCGTCCT GCCCCTCGGG CGCGATCTAC
AAGCGCAGCG AGGACGGCAT CGTGCTGGTC GATCAGGACC GCTGCCGGGG CTGGCGGATG
TGCATCACCG GATGCCCATA CAAGAAGGTG TACTTCAACC ACCGCACCGG CAAGGCGGAG
AAGTGCACGT TCTGCTTTCC ACGTATCGAG ATCGGCCAGC CGACCATCTG CTCCGAAACG
TGCGTCGGCC GACTGCGGTA CCTCGGCCTC ATGCTCTACG ACGGCGACAC GGTGGCCGAC
GCCGCCGCCA CCGAAGCCGA ACAGGACCTC TACGCGGCGC AGCGCTCGGT GTTCCTTGAC
CCCCACGACC CCGCCGTCGT GGCCGCCGCG CGGGCGGGCG GTATCCCCGA CGACTGGATC
GACGCCGCGC AACAGTCCCC GATCTGGGAC CTGATCATGA AGTATGAGGT GGCGCTGCCG
TTGCACCCGG AATATCGGAC CATGCCCATG GTCTGGTACA TCCCGCCCCT GTCCCCCGTG
GTGGACGTGC TGCGCGACAC CGGTCACGAC GGCGAGCAGG CCGGCAACCT CTTCGGCGCG
ATCGACGCCC TCCGTATCCC CGTCGACTAC CTCGCGGAAC TGTTCACCGC GGGCGACCCA
CAACCCGTGC GGGCGGTACT CGACCGGCTC GCCGCCATGC GTGCCTACCA GCGCCGCATC
AATCTTGGCG AGGCACCGGA CGAGACCATT CCCGCCGCGG TCGGCATGAC CAGCGACGAC
ATGGACGACA TGTACCGTCT CCTGGCTGTC GCCAAATACG AGCAGCGCTA CGTCATCCCC
GCCGCCCACG CCGAAGACGC CCACCGCCTC GAAAAGATCG CCACCGAGTG CGCCCTGGAC
TACGAAGGCG GCCCCGGCAT GGGCGGCGGT GGACCCTACG GGCAGGGCCC CTTCGGGGAG
TCCTCCGGCA CGCCCGTACC GATCCAGGTG GAGACCTTCG ACGCGCAGCG CAACCGGCAG
CGAGCCGACC TCTTCATCGA CCAGGGCGAC GCGGCACAGC GGGCCCGGCT GCTCGACGTG
GACAGCGAGG GCGACCGGAC CAACCTGGCC CACCCGAGGA AGGACCCGAC CGGCGAGGGC
GGCGACTCCA CCGGCCCTGG CGTCATCGAC ATCAACCGGG ACCAGCCGTG A
 
Protein sequence
MRVMAQMAMV MNLDKCIGCH TCSVTCKQAW TNRSGVEYVW FNNVETRPGQ GYPRTYEDQQ 
RWQGGWVRTR SGRLKPRSGG RLKKMFTVFA NPKLPSMRDY YEPWTYDYEH LISAPSGDDI
PVARPKSLIT GQDTKITWSA NWDDSLAGGN EVTAGDPVLA KVSEQVRQEY AKTFMFFLPR
ICEHCLNPSC AASCPSGAIY KRSEDGIVLV DQDRCRGWRM CITGCPYKKV YFNHRTGKAE
KCTFCFPRIE IGQPTICSET CVGRLRYLGL MLYDGDTVAD AAATEAEQDL YAAQRSVFLD
PHDPAVVAAA RAGGIPDDWI DAAQQSPIWD LIMKYEVALP LHPEYRTMPM VWYIPPLSPV
VDVLRDTGHD GEQAGNLFGA IDALRIPVDY LAELFTAGDP QPVRAVLDRL AAMRAYQRRI
NLGEAPDETI PAAVGMTSDD MDDMYRLLAV AKYEQRYVIP AAHAEDAHRL EKIATECALD
YEGGPGMGGG GPYGQGPFGE SSGTPVPIQV ETFDAQRNRQ RADLFIDQGD AAQRARLLDV
DSEGDRTNLA HPRKDPTGEG GDSTGPGVID INRDQP