Gene Sare_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0223 
Symbol 
ID5705985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp254896 
End bp256101 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID641269752 
Producthypothetical protein 
Protein accessionYP_001535149 
Protein GI159035896 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0103293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT GGGAGATCCT GCGTTTCGCC GGCCAGGGCG TCGCCGGCAA CAAGCTGCGC 
TCGGCACTGA CCATGCTGGG CATCCTGATC GGTGTCGCGG CGGTGATTCT GCTGGTCGCC
GTCGGCAACG GTTCGGCCCA GGCCGTCAAC CGCAGCATCG AGACGCTGGG CACGAACACG
ATCACCGTGT CGAGCACCGC TCGCGGTGGC TCCACGCCGT CCGCGCTCAC CGTCGACATC
GCCGAGGCGC TGTACGACCC GGCGCTGGCT CCTGACGTAC GGGCGGTGTC ACCGGTGGTC
ACCACCGCGC CGACGATCAC TCACGATGGC GCTGATCACC AGGTCGCCCA GTTCCTCGGC
ACGTATCCGA CCTACTTCGG TTCCTCGAAC AGCCAGGTTG CCAGCGGTGC CGGCTTCACC
GACGAGGACG TGACGCAGGG CCGTCGGGTG GTGGTGCTCG GCCGGACCGT CGCCGCTGAG
CTGTTCGTCG ACGCCGATCC AGTCGGTCGA CAGGTCACCG TCGGCGGTGC CCTTTACACG
GTGGTCGGTG TGCTCGCCGA GAAGCCTGCG GCCGGAGGAC TCACGGACTC CAACGACGTG
GCGATCGCTC CGCTGACCGC CGTACAGCAG ACGCTGACCG GGTACGGTGC GGTCAACTCG
ATCCTGGTCG AGGCCGCCGG GGCGGATCGG GTGAACGCCG CCCAGGACCA GGTCACCCGG
ATCCTCGACC AGCGGCTCAA CGATCCGACG GGCGCCACCG CCGCGGCGCC GTATCGCATC
CAGAACGCCA GCCAGCTACT CGCCACCCGT ACCGAGACCG CGCGGACCTT CACCGTGCTG
CTCGGTACCG TCGCCGGCAT AAGCCTGCTC GTCGGTGGAA TCGGCATCAC CAACATCATG
CTGGTGACGG TCACCGAACG GACCCGGGAG ATCGGTATCC GGAAGGCCCT CGGCGCCCCT
CGGCGCACCA TCGCGACCCA GTTCCTCGCC GAGGCGACCC TGCTCAGCGT GCTCGGCGGC
GGCCTCGGTG TGGCCGTGGC GCTGATCGGC AGCCGCTTCA CCATCGTCGG CGTGCAGCCG
GTGATCGTGC CCAGCTCCGT CGCGCTGGCG CTGGGCGTCT CAGTCGCCAT CGGGCTCTTC
TTCGGCAGCG TCCCCGCCAA CCGGGCCGCC GGGCTACGTC CTATCGAGGC ACTTCGCTAC
GAATGA
 
Protein sequence
MSLWEILRFA GQGVAGNKLR SALTMLGILI GVAAVILLVA VGNGSAQAVN RSIETLGTNT 
ITVSSTARGG STPSALTVDI AEALYDPALA PDVRAVSPVV TTAPTITHDG ADHQVAQFLG
TYPTYFGSSN SQVASGAGFT DEDVTQGRRV VVLGRTVAAE LFVDADPVGR QVTVGGALYT
VVGVLAEKPA AGGLTDSNDV AIAPLTAVQQ TLTGYGAVNS ILVEAAGADR VNAAQDQVTR
ILDQRLNDPT GATAAAPYRI QNASQLLATR TETARTFTVL LGTVAGISLL VGGIGITNIM
LVTVTERTRE IGIRKALGAP RRTIATQFLA EATLLSVLGG GLGVAVALIG SRFTIVGVQP
VIVPSSVALA LGVSVAIGLF FGSVPANRAA GLRPIEALRY E