Gene Sare_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4051 
Symbol 
ID5706314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4608404 
End bp4609774 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID641273477 
Productpyridine nucleotide-disulphide oxidoreductase dimerisation region 
Protein accessionYP_001538832 
Protein GI159039579 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.738212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0461677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGAAC GGTTGATCGT CATCGGCGGT GACGCCGCCG GGATGGCGGC AGCATCGCAG 
GCCCGCCGCC GCCGCAGCCC GGACGACCTG AAGATCGTCG CCTTCGAACG AGGCCGATTC
ACGTCGTACT CGGCGTGCGG AATCCCGTAC TGGATCAGCG GCCTGGTGCC GCAACGCGAC
CAGCTCATCG CCCGGGACCC GGCGACCTTC CGGGACCGGT TCGCCATCGA CGTCCGGCTG
CGCCACGAGG TCACCAGGAT CGACCTCGAC CGACGCGAGG TGATCGCCCG AGACCTGACC
GGCGGCGACG AGGTCCGCGA GCGATTCGAC ACCCTGGTGT ACGCGACCGG AGCACACCCG
GTCCGGCCGG CCTGGGCGCG CAGCGACGTG CTCGGCGTGT TCGGCGTGCA GACCCTCGAG
GACGGCGCTG CCCTCCGGGA CTGGCTGGAC GGGGATACCC GGCCCCGCCA GGCGGTCGTG
GTCGGTGGGG GCTACATCGG GGTCGAGATG GCCGAAGCGC TGATCCAGCG CGGGCTGGAG
GTCACCCTGG TCGAGAAGGC CCAGCAACCG ATGTCGACGG TGGACCCGGA CATGGCCGAG
CTGGTCAACG ACGCCATGCG GGGGGTAGGG GTGCGAATCC GCACCGGCCT GGCGGTGACC
GGCCTCCAGG AACGGGACGG ACGAGTATCC GCGGTGCTCA CCTCCGATGG GCCGATCCCC
GCCGACCTGG TCGTGCTCGG CCTCGGCGTC CGCCCGAACG TTGAGCTCGC CGAGGCCGCT
GGGCTCCCGG TCGGACCGAG CGGGGCGTTG CGGGTGGACC GGCGCATGCG GGTGCCGGGA
GCGGACGACG TGTGGGCCGC CGGTGACTGC GTGGAGTGCC TGCACCGAGT CAGTGGAATG
CCCGTACACA TACCGCTGGG AACGCACGCC AACAAGCAGG GCCGGGTCGC CGGGATCAAC
ATCGGCGGCG GCTACGCGAC TTTTCCCGGG GTGATCGGCA CCGCCGTGAT CAAGGTGTGT
GACCTGGAGG TCGGCCGCAC CGGGCTACGG GAGCAGGACG CCGCGGCGGC GGGTTTCGAG
TTCGTCTCGA TCATCACCGA GTCCACCAAC CGGGCCGGGT ACTTCCCCGG CTCACGTCCG
ATGACGGTCA AACTGATCGC CGAACGGCCC ACCGGGCGGC TGCTCGGCGC GCAGATCGTC
GGCTGGTCGG AGGCCGCGAA GCGGATCGAC GCCCTGGCCG TGGCGCTGTG GAACGGCATG
ACGGTGGACG ATATGACCGC CCTCGACCTC GGGTACGCCC CGCCGTACTC CCCGGTCTGG
GACCCAGTGT TGATCGCTGC CCGCAAGGCG GTCGACGCCC TCGGCCGGTG A
 
Protein sequence
MAERLIVIGG DAAGMAAASQ ARRRRSPDDL KIVAFERGRF TSYSACGIPY WISGLVPQRD 
QLIARDPATF RDRFAIDVRL RHEVTRIDLD RREVIARDLT GGDEVRERFD TLVYATGAHP
VRPAWARSDV LGVFGVQTLE DGAALRDWLD GDTRPRQAVV VGGGYIGVEM AEALIQRGLE
VTLVEKAQQP MSTVDPDMAE LVNDAMRGVG VRIRTGLAVT GLQERDGRVS AVLTSDGPIP
ADLVVLGLGV RPNVELAEAA GLPVGPSGAL RVDRRMRVPG ADDVWAAGDC VECLHRVSGM
PVHIPLGTHA NKQGRVAGIN IGGGYATFPG VIGTAVIKVC DLEVGRTGLR EQDAAAAGFE
FVSIITESTN RAGYFPGSRP MTVKLIAERP TGRLLGAQIV GWSEAAKRID ALAVALWNGM
TVDDMTALDL GYAPPYSPVW DPVLIAARKA VDALGR