Gene Sare_5078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5078 
Symbol 
ID5704213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5752445 
End bp5755384 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content68% 
IMG OID641274470 
Productglycosyl transferase family protein 
Protein accessionYP_001539811 
Protein GI159040558 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0744] Membrane carboxypeptidase (penicillin-binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000111802 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTCGT ACGGCGACCC CAGTTCCCCG CGCGGGCGGG CCCGGATACC GGGCCAGCAC 
GACCCCGGGC TGGCCGACGA CGCGTACCGT GCCCCGAATG ACGAGGTGCG TGGCCGGGCG
GCCGCGCCGG AGGCATCGGC GGGCCGTGCC TCGGTAACAC CCGGCGGCGG TGCCTCCAGC
GGGCGGGCCT CGGTTGGCGG CTCGGCGTCC GTCCCCTCCC CGGCTGCCGC CGGCACGGCC
TCGGTCGGTC GCGCCTCGGC GTCGGTATCG GCTGCGCCCG GCCGTGCATC TGTGCCGGCA
CCGGCCGCGC CCGGTCGCGC CTCCGTGCCG GCACCGTCGA CGCCCGGTCG CGCATCGGTG
CCGACATCCC CGGCGCCGGG TCGTGCCTCC GTGCCGGTCT CTCCGGCGCC GGGCGGCTCT
ACTGGGCGCG CGACCGTCGG TGCGGCCTCC GTGGGAACGG CCTCGGCCGG CCGGGCTCGG
GTCGGCACGG CGGCGGTCGA GGGGCGGGCC TCCGTGGCGC GGGCCAGCGT CGGACCGGCG
TCCGCCGGTC CTGGTGGTCC GGGCGGCCCG AGTGGTCCGG GCAGGTCCAG GTCCGGCGGC
CGTGATCCGA ACGCCGCCGC GCGGGCGAAG AAGCGGAAGC GGGCGAACAT GCTGATCGCC
GCGTGCGCGG TCTTCATCAT GCTCGCCGGC GTGGGTGTAG TCGGCTTCGC CTATTACTCC
ACCAACGTCG TTCTACCCAA CCAGATTCCA CTTCCGCAGT CGACGACGGT CTACACGAGT
GACGGCAAGG GGCTGGTCGC CAAGCTTGGC AACGAGAACC GGACGCTCAT TACCGTCAGC
CAGATGCCGC AGCACGTTCG CCACGCGGTG GCCGCTGCCG AGGACCGTAA CTTCTACCGG
CACTCCGGCG TCGACTACAA GGGCATCGCC CGAGCAGCGT GGAACAACTT CACCGGTGGC
CACCGACAGG GCGCGTCGAC GATCACGCAG CAGTACGCGC GTGGCGCCTA CGAGAGCCTC
GAGGACGACA CCTACACCCG GAAGGTGCGG GAGGCGATCT TCGCCTCGAA GCTAAACGAC
GATTTCAGCA AGGAAAAGAT CATGGAGAAC TATCTCAACG TGATCTATTT CGGACGCGGA
GCGTACGGGA TCGAGGCCGC GGCGCAGACC TTCTTCGGCA AGGCCGCCAG CAAGTTGACC
GTCGGCGAGG GCGCGGTGCT GGCTGGGATC ATCAAGCAGC CGGAGCCCTC CGCCACCCAC
AACGGGTTCG ACCCGGCGAC CGCCCCGGAC GACGCGAAGT CGCGATGGGA CTACGTTCTC
GACGGGATGG TGGCCGAGGG CTGGCTCGAC GCGGCGGAGC GGCCGACCGA ATATCCGAAG
GTGAAGCCGC CGGCCGAGGG CGGCAACGGC TTCGGTGTGG CGACCCCACG CGGCAACGTC
ATCAACTATG TGCGGGCGGA AATGGAGCAG TGGGGACTCT GCACCAACAC GGGTGCCGAC
GAGGTCAAGC CCTCCTGTGC GGATGAGCTA CGCAAGGGCG GCTACAAGAT CACCACAACG
ATCGACGACA AGATGCAGAC CGCTCTGGAG AAGGCGGCAC GAGCGGGGGT AAAGGGTTCG
GTCCTCGACG GCCAGCCGGA CAATCTGATG GCTGCCGGAG TCGCGATAGA CCCCAAGACG
GGCCGGGTGC TCGCCTACTA CGGTGGGGAA AGCGGTGGTG ACATCGACTT GGCCGGCAAG
AACACCACAG ACGGCATCCT CTATGGTGGC CATCCCCCTG CCTCGTCCTT CAAGGTCTAC
ACTCTTGCGG CCGCCATCGA GGCCGGCATC TCCGTCAACT CCCGGTGGGA CGCGACGCCC
TTCACCCCCG AGGGGTACGA CGACAAGATC CATAACGCGA GCCGGAACGC GCACTGTGGT
AAGTCCTGCA CCCTGGACGA GTCGACGGTC AAGTCGTACA ACGTTCCGTT CTTCCATGTG
GCGGAGCAGA TCGGCCCGGA CAAGGTGGTC AGCATGGCCC GGCAGGCGGG TATCACCACC
ATGTGGAACA ACGACGACCC CGCCACGCCG TTCAACCTGT TGGCGGAGAA GCCGGAGGAC
CTGGCACCGA AGCAGTTCGA CCACGTGGTG GGCTATGGCG CGTACCCGGT CACCGTCCTC
GACCATGCCA ACGGTCTCGC GACGCTGGCG AACGACGGTC TATACCATAA GGCGCATTTC
GTGCTCAAGG TCGAACAGAA GGACAAAACG ACCGGCGAGT GGAAGGTCGT CCGTGGGACC
GGCGAGAAGC TGGACGGGCA GCAGCGGATC CGGAAAGGGG TCACCGACGA GGTGACCGCG
GTGCTCAAGC AGATCCCGAG TGAGAACGGC GATGCCCTGT CCGGTCGTCG GCAAGCGGCC
GGGAAGACCG GCACCTGGGA ACTCATCGGG ACCCCCCACA ACTCGAATGC CTGGATGGTC
GGCTACGACG ACAACCTGGC GACAGCGATA TGGATCGGCG CTAACCCCGA GGCAGAAAGT
AAAGCGATTC TCACCAAGAA CAAGAAGAAC ATCGGCGGTA GCGGTCTCCC GGCGGACCTG
TGGAAGCGGT TCATGGACGA CGCGCTCAAC GGTAAGCCCA AGTCCGACCT GCCGCGCATC
ACCGGAGTCG GCGACGACAC GGTCGGCAAC GGCGAGCAGC CGAAACCGGA GCCGCCGGAC
TGCGGGTGGC TCGGCGGCCT GTTCTGCCCG GACGACGACG ACGATGATGA CGACAACGGC
GGTGGTGGCG ACAACGGCGG TGGCGGTAAC AACGGCGGTG GCGGTAACAA CGGCGGTGGC
GGTAACAACG GCGGTGGCGG TAACAACGGC GGTGGCGGTA ACAACGGCGG TGGCGGCGGT
GGAGACATCG GGTTCCCGCC GCCGCCAACC GGAAACACCG AACGACCCAG GCGGGACTAG
 
Protein sequence
MNSYGDPSSP RGRARIPGQH DPGLADDAYR APNDEVRGRA AAPEASAGRA SVTPGGGASS 
GRASVGGSAS VPSPAAAGTA SVGRASASVS AAPGRASVPA PAAPGRASVP APSTPGRASV
PTSPAPGRAS VPVSPAPGGS TGRATVGAAS VGTASAGRAR VGTAAVEGRA SVARASVGPA
SAGPGGPGGP SGPGRSRSGG RDPNAAARAK KRKRANMLIA ACAVFIMLAG VGVVGFAYYS
TNVVLPNQIP LPQSTTVYTS DGKGLVAKLG NENRTLITVS QMPQHVRHAV AAAEDRNFYR
HSGVDYKGIA RAAWNNFTGG HRQGASTITQ QYARGAYESL EDDTYTRKVR EAIFASKLND
DFSKEKIMEN YLNVIYFGRG AYGIEAAAQT FFGKAASKLT VGEGAVLAGI IKQPEPSATH
NGFDPATAPD DAKSRWDYVL DGMVAEGWLD AAERPTEYPK VKPPAEGGNG FGVATPRGNV
INYVRAEMEQ WGLCTNTGAD EVKPSCADEL RKGGYKITTT IDDKMQTALE KAARAGVKGS
VLDGQPDNLM AAGVAIDPKT GRVLAYYGGE SGGDIDLAGK NTTDGILYGG HPPASSFKVY
TLAAAIEAGI SVNSRWDATP FTPEGYDDKI HNASRNAHCG KSCTLDESTV KSYNVPFFHV
AEQIGPDKVV SMARQAGITT MWNNDDPATP FNLLAEKPED LAPKQFDHVV GYGAYPVTVL
DHANGLATLA NDGLYHKAHF VLKVEQKDKT TGEWKVVRGT GEKLDGQQRI RKGVTDEVTA
VLKQIPSENG DALSGRRQAA GKTGTWELIG TPHNSNAWMV GYDDNLATAI WIGANPEAES
KAILTKNKKN IGGSGLPADL WKRFMDDALN GKPKSDLPRI TGVGDDTVGN GEQPKPEPPD
CGWLGGLFCP DDDDDDDDNG GGGDNGGGGN NGGGGNNGGG GNNGGGGNNG GGGNNGGGGG
GDIGFPPPPT GNTERPRRD