Gene Sare_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4133 
Symbol 
ID5705578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4696820 
End bp4697671 
Gene Length852 bp 
Protein Length283 aa 
Translation table11 
GC content71% 
IMG OID641273561 
Producthypothetical protein 
Protein accessionYP_001538914 
Protein GI159039661 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.242281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.352812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA GCAACGGGCA GGAGGCGGAC GGTGTCCTCC GGCGGGATCG ACACCGGGGC 
GCCGGGCTAC TGCGTCGGTC GACGGTCGGC GGGAGTACCG CCGACCAGCG GCTGCTCGAC
TCCCCCCAGC GCAGCGACTG GAAGACCCGG GACGCCTGGC GGGCGCTGCG GATCCTCTCC
GAGTTCGTCG AGGGCTTCGA CACCCTGTCC GACCTGCCGT CAGCGGTCAG CGTCTTCGGT
TCGGCGCGGA GCCGACCGGA CAGCCCGGAG TGCCGGATGG CCGAGGCGCT GGGCGGTGCA
CTGGCCCGTG CCGGATACGC GGTCATCACC GGCGGGGGCC CGGGGGTGAT GGCGGCGGCG
AACCGGGGAA CCAGGGAAGC CGGCGGGCTC TCCGTCGGCC TGGGCATCGA GCTCCCCTTC
GAGCAGGGCA TCAACGACTG GGTCGATCTG GCGATCGAGT TCCGGTACTT CTTCGCGCGA
AAGACCATGT TCGTCAAGTA CGCCCAGGCG TTCGTGGTGC TCCCCGGGGG CTTCGGCACG
ATGGACGAGC TGTTCGAGGC CCTCACCCTG GTGCAGACCG GCAAGGTGAC CCGGTTCCCG
GTGGTGCTGA TGGGTGTCGA CTACTGGCGC GGCCTACTCG ACTGGCTGCG GGACACGATG
GTGGCCGACG GCAAGATCGG GGCGATCGAT CTCGACCTGA TCTGCCTCAC CGACGACGTG
AACACGGCGG TGCGGCACAT CGTCGAGGCC GAGGCGCTGC TCTCCGCCGA CCAGGAGGCC
GTCCGTGAGG AGGCGGTCGC TGTCGCCGCC GCCGAACGGC GGGCCGCCGC CGACGAGGGG
GGTCGGGGCT GA
 
Protein sequence
MSESNGQEAD GVLRRDRHRG AGLLRRSTVG GSTADQRLLD SPQRSDWKTR DAWRALRILS 
EFVEGFDTLS DLPSAVSVFG SARSRPDSPE CRMAEALGGA LARAGYAVIT GGGPGVMAAA
NRGTREAGGL SVGLGIELPF EQGINDWVDL AIEFRYFFAR KTMFVKYAQA FVVLPGGFGT
MDELFEALTL VQTGKVTRFP VVLMGVDYWR GLLDWLRDTM VADGKIGAID LDLICLTDDV
NTAVRHIVEA EALLSADQEA VREEAVAVAA AERRAAADEG GRG