Gene Sare_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3343 
Symbol 
ID5708298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3858662 
End bp3862942 
Gene Length4281 bp 
Protein Length1426 aa 
Translation table11 
GC content65% 
IMG OID641272770 
Producthypothetical protein 
Protein accessionYP_001538137 
Protein GI159038884 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00184093 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000724597 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTTTCA CGGCACCTGA AACCCCGTGT CCGCCGCAAG TAGCCCGACA TCCCACTATC 
TACCGCCTCG TCGAGGATCT TGAATTCTTC GAACGTGGCT ACGGAGTGAT CATGAGATCC
CCTACCGACT TGGAAACTCC TGAATTGCCT CGGTGGCGTA GTACGTCACA CCGAGGCTTA
CTTATCGGAC TAGTCATCCT GTCCCTGCTG GCGAGCTTGC TGCCGGGCAA TCCGGCGCGA
GCTAACGAAC CATCACCTCT TGCTGTCGAT CGGTCGCCGG TGGTGCAGGC ATGGCTGGCC
GGTGGTTCTC AGGTCCGCAC TGCGGCAGAG CGGGCCCTGA TCGGTTCGGA TGAGGACATC
CAAACGTTCC TCGACGAGGG TTGGGAACAG GCGCAGCGGC TGGACGAGCG TGACGCGCTG
GTGGCGGTGA TCGCTGAAGG CGGGCCGGCG CTCCGGAGGG CTGCGGAGCA GGCTCTAACC
GCCGCCGACG ATGGTGATCA GTCCGCCTTG CGAATGTTCC TAGACTCCGG CTGGCAACAG
CCGTCGAACA CCGACACCCG GTTGCGTGTT AACCAGCTGA TGGCAACTGG TGGGAATGAG
GTGAAGGCCG CCGCGCAGGA GGCGTTGGAC TCACCGGATC CATTCGTGTG GCACGAATTT
CTGGAGTCAG GATGGCAGTC ACGGTGGCTG ATTGACCAGC GGATCCGGGT CAACCAGGCG
ATGGCCGCGG GCGGGCCGCA GGTGAAAGCA GCAGGTCAGA AGGCGCTGGA TGCCGGGACA
CCCGAGGCGT TCGAAACGTT CCTGGGTTAC GGCTGGTCAG TGGCGGCGGC CCAAGACCAG
GAGACCGAAA CGCTGACCTC ACTGCAGGCG CAGGCACAGG CACAGGGTGA CCTGGCCGCG
CAGGAAACTC AGCGGGCCGA GGAGGAGGCG GCACGAGCAA AGGAGGCCGC GGAGGCGGCT
CGTCGGTCGG CTCAGGAGGC CGCCGCGGCG ACGGACGCGG CGCGGCAGGA TACGGCTGAG
GCGGCGGCGC AGGCGAAGCG GGCCGCGGTG GCCGCGCAGA AGGCGGCGTC GGCTGCGAAG
GTGGCGGTGC GGGCGGCGGC GTCGGCGAAG CGGGCGGCGC GGGCAGCCGC AGCTGCGGCA
CAGCGCGCGG CGTCGGCGGC AGCGCAGGCG GATCGGGCGG CGACGAAGGC GTACAACGCC
GCGGCACAGG CGGCGACGGA TGAGTCGAAG GCCGACGAAG CGCGAGCGAC CGCGCAGAAG
GCAAGGGAGC AAGCCGGCCT AGCACGGGAG CTTGGCACCA TCGCCGATCT GGCAGGTAAG
GCCATCCAGG CAGGTTCGGA TGCGATAGAT GCCGTTAAGG CGGCGGTGGC GCAGGCGAGG
CTGGCAGCCG CCGCCAACGA CGAGGCAGTC CAGTACGCCA ACGCGGCCGG CGCGAACGCC
TCGGCAGCGG TGGCGGCGGC CGAGCAGGCG CGGGCGGACG CGGACCGGGC GCTGCGCGCG
GCAAACGCGG CCCAGAAGTA CCTGAACGTG GCGGCGCAGG CGGCCTTCGC TGCGCGGGAC
GCGGCGAATC GGGCGGCACA GCATGCGGAA GATGCGGCGG ATGCCGCGAT CGACGCGGCG
AACCACGCCG GTGACGCGGC GACGGCGGCA CAACGTGCTA GCGAGGCCGC CAACTCAGCA
ACCATAGCGG CACAGGCGGC GGTGGAGACC GCCAGCCAGG CGATCGAGGT GTATGACGCG
GCGCGGGAGG CAGACACCGA GCGGCTCGCC GTCTTTCAGG ATGAGCGTGT CGAGGCGGCT
CACCAGGCCG CCGCGCAGTA TGACGAGGCG CAGGCCGCGG CCACATGGGA CGCGCTGCAG
GCCGAGCAAC GGGACGCGGA GACCGATCAG CTGATCGCCG AAGCGCTGAA CCCGGCAACC
GAAACCGCTG CCGCAGTAAC CGCTGCCCGC AAGGTGGCGA TGAACCTCGT GCATGCCTCC
GGCACCTGGA CGCGGCAGGC CGCGCAGGCA GCGCTCGGTG GCAGTGATGC CCAGGTCATG
GAGTTCGTTC GGACTGGTAT CGCAGAGGCA GCCGGGCAAG ACGACCGGAT CGTGGTTGGC
GAACTGGCAA TCAGTGAAAA CACCTCGCTG CGTGATGCGG CACTCGCGGC GTTGGAGGGC
AGCGACGCCG AGGTATCTCA GTTCCTCGCT ACGCAGGACT ACCCCGGGCG CTACATCCAA
GACCGGCTCA AGGTCAACCA GATCATGGCC GCCGCGAAGG ACTCCGGCGA TACCCACCTG
GCGCAGAAGG CGCAGGAGGC CCTCAGCAAC GGCGACGGGC AGGTATTGCG CACGTTCATC
GCGTCAGGCC AGCACACGGC GGCCGCCATC AGCCAGCGGA TCCAGGTCAA CCAAATCCTG
GCCAGTGCGG AAAGCGGCCC GGAGGTCAAG GCTGCCGCAC AGATTGCCCT CACTGGGCCG
CCGCCCGGGC TCCACAAGTT CCTCACCGAG GGGCGCTACG CCGCCGCCGA GCGGGACCAA
AGCGCCGCCG CGCACCTGGC AGTGGTCGCC AGCCTCGTGG AACGGATCAA CGAGGTGGCC
CAGACCGCGA CACAGGATGC GATGAACGCG CAGGCGGTCG CGGCCGAAGC CCGCGACGAT
GCGGCATCCG CGGCGGACTA CGCGAACCAG GCCGCCCAGT CAGCGCAGGC TGCGGCTACT
TACGCCGCAC AGGCAGCGCA GTACGCCAAT CAGGCCAGTA AGTCAGTGGC CGAGGCCGAA
GCCGCAGTCC AGACCGCCAA GACGGCAGCG ACCCAGGCAG TCGACTCGGC ACGCAGCGCA
ATCCGCTCTG CTTCCTGGGC GATTCTCTCC CACGAACGGG CCGTCCAATC TGCCAGGGAA
GCGCAAGCCT TCGCGCAAGC TGCGTACGAC TCCGCCATCG CCGCCGGTAA GAGCGCTGAG
GAAGCAGCCA AGGCGGCAGA AGACGCGCGT CGTGAATACA AGCTGGCCGA GGGCCGCGAG
GTCGCCATAT GCACTTTCAA TTACAACGAC GCTGGAGATG ATTGGGACAA GTTCCTTACC
GACGAATCGA ACGATGCCGC CGAGAATTGC GTCCGAAACG TCATCGTCAA CCCGTCCGAA
CTGACCAGCC GGGCGTACAT CAACGCCGGG TACTGCGACG TCTATACCAG CGGTAGCCAG
TACTACAAGA ACTGCATTGC ATCGGTCCTG TCTCCCACCT TCCTGCGTGA TCAGACACTG
ACCGTTATCA CGGCAATTAT CGAAGATATT ACCGCGTTCC TAATCCCGGT TGCCGGAGCC
CTCGCGATCG GCTGCCTGCT GACGGTCGTT TGCGGCACGG TAGCCCTCAC TCTATTGACC
ATCGGTGAGG TCGGATTCAA CTTTGAGCAA TACATAAGCG GTGACCAAGA TCTAGCTGAC
ACCCTGCTAG ACCTGGGCAC CCTAGCGCTA GAAGCCCTGG CCTTCGCCGG AGCCGCTAAA
CTCGTCGGCA CCGGCTTCCA AGCCGCTAAA CAGCTCTATA CCATCAGCCG CGCCGCCAAA
CAGGCTGAGG AAAACTTGGC AGCGGCCAAC ACTTCGCGAG GTCACCTGTT GGTCACCTCG
TGCCTAACTG GGAACAGCTT CTCCGCCAAC ACGCTCAGAG ATGGCGGCAG CGAACCCATC
GCAAACGTTC TCGAGGGTGA CCGCGTCCTG GGAACCGATC CCACCACCCG TGCCACCATT
GCGCGGCCCG TGATGAACGT CATCCGTAAT ACCGGCACAA GGCTCACCGG CGCGGCGGTC
GCGTCGGCCA CCGAATCGGC GACGCACAAC CTGGCGGTCG CCGAGCCCCA CACGTACCAT
GTGCTCGCCG GCGCCATGCC GGCGCTGGCC AACAACTGCG ACTTGATCAG GGTGGTGCAT
GCAGAGTACG AGAAAATTAC TACGGTTGGA TCACCCGACT ATCTTTCGAT CAGAAGTCGA
GGGCCAGTCC TCACGGGCGT CAAGGACGAG ACTACCGGCG ATATTGTCAC CTCGCTAAAC
CATTCAGATT CGATCGAGAA TTTACACCCT AGCCTCGCCG CGCGACTGGG TCCGGACATT
GGATCGCTAT ACCCTGGCGG TTCAGGAATC CACGGTGAAG TACACGGTCT CAACGAATTG
CTCTGGAGGC GGGAGTCCGC AGGGTTAAGT ACTCAGATCG ACGACAGCTT TAGCTATTAT
AGCGTCAGAC TTCGAGGAGC GAAGCAAGGA ATGCTAATCC CACCGTGTCC AGTCTGCTCA
CGACTCACTC CCTGGCTGTA G
 
Protein sequence
MPFTAPETPC PPQVARHPTI YRLVEDLEFF ERGYGVIMRS PTDLETPELP RWRSTSHRGL 
LIGLVILSLL ASLLPGNPAR ANEPSPLAVD RSPVVQAWLA GGSQVRTAAE RALIGSDEDI
QTFLDEGWEQ AQRLDERDAL VAVIAEGGPA LRRAAEQALT AADDGDQSAL RMFLDSGWQQ
PSNTDTRLRV NQLMATGGNE VKAAAQEALD SPDPFVWHEF LESGWQSRWL IDQRIRVNQA
MAAGGPQVKA AGQKALDAGT PEAFETFLGY GWSVAAAQDQ ETETLTSLQA QAQAQGDLAA
QETQRAEEEA ARAKEAAEAA RRSAQEAAAA TDAARQDTAE AAAQAKRAAV AAQKAASAAK
VAVRAAASAK RAARAAAAAA QRAASAAAQA DRAATKAYNA AAQAATDESK ADEARATAQK
AREQAGLARE LGTIADLAGK AIQAGSDAID AVKAAVAQAR LAAAANDEAV QYANAAGANA
SAAVAAAEQA RADADRALRA ANAAQKYLNV AAQAAFAARD AANRAAQHAE DAADAAIDAA
NHAGDAATAA QRASEAANSA TIAAQAAVET ASQAIEVYDA AREADTERLA VFQDERVEAA
HQAAAQYDEA QAAATWDALQ AEQRDAETDQ LIAEALNPAT ETAAAVTAAR KVAMNLVHAS
GTWTRQAAQA ALGGSDAQVM EFVRTGIAEA AGQDDRIVVG ELAISENTSL RDAALAALEG
SDAEVSQFLA TQDYPGRYIQ DRLKVNQIMA AAKDSGDTHL AQKAQEALSN GDGQVLRTFI
ASGQHTAAAI SQRIQVNQIL ASAESGPEVK AAAQIALTGP PPGLHKFLTE GRYAAAERDQ
SAAAHLAVVA SLVERINEVA QTATQDAMNA QAVAAEARDD AASAADYANQ AAQSAQAAAT
YAAQAAQYAN QASKSVAEAE AAVQTAKTAA TQAVDSARSA IRSASWAILS HERAVQSARE
AQAFAQAAYD SAIAAGKSAE EAAKAAEDAR REYKLAEGRE VAICTFNYND AGDDWDKFLT
DESNDAAENC VRNVIVNPSE LTSRAYINAG YCDVYTSGSQ YYKNCIASVL SPTFLRDQTL
TVITAIIEDI TAFLIPVAGA LAIGCLLTVV CGTVALTLLT IGEVGFNFEQ YISGDQDLAD
TLLDLGTLAL EALAFAGAAK LVGTGFQAAK QLYTISRAAK QAEENLAAAN TSRGHLLVTS
CLTGNSFSAN TLRDGGSEPI ANVLEGDRVL GTDPTTRATI ARPVMNVIRN TGTRLTGAAV
ASATESATHN LAVAEPHTYH VLAGAMPALA NNCDLIRVVH AEYEKITTVG SPDYLSIRSR
GPVLTGVKDE TTGDIVTSLN HSDSIENLHP SLAARLGPDI GSLYPGGSGI HGEVHGLNEL
LWRRESAGLS TQIDDSFSYY SVRLRGAKQG MLIPPCPVCS RLTPWL