Gene Sare_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3887 
Symbol 
ID5706384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4428476 
End bp4429705 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID641273312 
Producthypothetical protein 
Protein accessionYP_001538669 
Protein GI159039416 
COG category[R] General function prediction only 
COG ID[COG4552] Predicted acetyltransferase involved in intracellular survival and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.305747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTCG TACCGAAATT CGGCGTCGCC CGTACCGAGG ATCTCGATGA GTTGGGTGAG 
CTGCTCGGCA ACCTGTTCCA TGGTGGGTTG GACGCCGAGT CGTGGGCCGT CGAACGGAGC
GTTCTGGAGC CAGAACGCAC TCTGATCGTG CGTGACGGCG CCGAGTTGGT GGCGACCGCC
ACCGCCTTCA CCCGCGAGCT GACCGTGCCG GGGGCACGCC TGCCCGCAGC GCACGTCAGC
ATGGTCGGAG TGGCGCCGAC ACACCGCCGC CGTGGCCTGC TCACCGAGCT GATGCGACGA
CAGCTCACTG GCGTACGCGA CGCCGGCCAG GAGCCGGTCG CGCTGCTGTG GGCCAGTGAA
GGGCAGATCT ACCCACGCTT CGGCTACGGC CTGGCGTGCC ACCGGTTGTT GGTGGATTGC
GATACAACCG AGCTGCGCCT GCCGGATCCG ACCGCCGCCG AGGGCCGACT CCGCCTGGCC
AAGCCGGATC GGTGCCGGGC CGAGCTGGCC AGGGTGTACG ACGGGGTTCG GCCCGACCGG
CCCGGCTGGT CCAGCCGCAA CGAGGCGTGG TGGAACTTCG TCCTGTCCGA TCCGGTCGCT
CGCCGTGGCG GTGCCACCGA ACGTCGAGTT CTGCTGCACG AGGGCCCCGA CGGCGTCAAC
GGGTACGCAC TCTTCCGCAC CCGGAACGAC TGGGACATGG TCGGGCCGAG GGCCATCACC
ATCGTTGACG AGGTGGTGAC CGACGATCAG GCCGGCTACC TGGCGATGTG GCGCCTGCTG
CTCACCCTCG ACCTGACCCG ACGGCTCACG GTCCGCGGCG CGGCACTGGA CGAGCCGCTG
CTGCGGCTGG TCAACGAACC ACGTCGGCTC GACGCACGCC TGCACGACGC GCTCTGGCTG
CGGGTGGTGG ATCTACCGGC CGCCCTCACC GCCCGGCGGT ACGCAACCTC GATCGACGTG
GTGCTGGGGG TCACCGACGA TCTTCTGCCG GAGAACACCG GCCGGTGGCG ACTGGCCGGC
GGGCCGGACG GGGCCAACTG CGCCCCCAGC ACCAGGGCGG CCGACCTGGC CTGCGACATC
CGGGCCCTCG GTGAGCTGTA CCTGGGTGGA GCCGGACTGG GCGAACTCGC CGCTGCCGGC
CGGGTCCGTG AACTTCGGCC CGGCGCCCTG GCGACCGCCG CGCCGGCCTT CGGTTGGCAT
CGCGCGCCCG CCGGCATGGA AGTGTTCTGA
 
Protein sequence
MTFVPKFGVA RTEDLDELGE LLGNLFHGGL DAESWAVERS VLEPERTLIV RDGAELVATA 
TAFTRELTVP GARLPAAHVS MVGVAPTHRR RGLLTELMRR QLTGVRDAGQ EPVALLWASE
GQIYPRFGYG LACHRLLVDC DTTELRLPDP TAAEGRLRLA KPDRCRAELA RVYDGVRPDR
PGWSSRNEAW WNFVLSDPVA RRGGATERRV LLHEGPDGVN GYALFRTRND WDMVGPRAIT
IVDEVVTDDQ AGYLAMWRLL LTLDLTRRLT VRGAALDEPL LRLVNEPRRL DARLHDALWL
RVVDLPAALT ARRYATSIDV VLGVTDDLLP ENTGRWRLAG GPDGANCAPS TRAADLACDI
RALGELYLGG AGLGELAAAG RVRELRPGAL ATAAPAFGWH RAPAGMEVF